Follistatin-related fusion proteins and uses thereof

ABSTRACT

In certain aspects, the present disclosure provides compositions and methods for inhibiting activity of TGFβ superfamily ligands, particularly ligands such as GDF8, GDF11, activin A, activin B, activin C and activin E, in vertebrates, including rodents and primates, and particularly in humans. In some embodiments, the compositions of the disclosure may be used to treat or prevent diseases or disorders that are associated with abnormal activity of a follistatin-related polypeptide and/or a follistatin ligand.

RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No.62/138,886, filed Mar. 26, 2015 (now pending). The content anddisclosure of the foregoing application are incorporated herein byreference in their entirety.

BACKGROUND OF THE INVENTION

The transforming growth factor-β (TGFβ) superfamily contains a varietyof growth factors that share common sequence elements and structuralmotifs. These proteins are known to exert biological effects on a largevariety of cell types in both vertebrates and invertebrates. Members ofthe superfamily perform important functions during embryonic developmentin pattern formation and tissue specification and can influence avariety of differentiation processes, including adipogenesis,myogenesis, chondrogenesis, cardiogenesis, hematopoiesis, neurogenesis,and epithelial cell differentiation. Superfamily members have diverse,often complementary effects. By manipulating the activity of a member ofthe TGFβ family, it is often possible to cause significant physiologicalchanges in an organism. Changes in muscle, bone, cartilage and othertissues may be achieved by increasing or antagonizing signaling that ismediated by an appropriate TGFβ superfamily member.

Naturally occurring proteins often referred to as ligand traps functionas extracellular regulators of TGFβ superfamily ligands. Such ligandtraps act either in soluble form or attached to the extracellular matrixand typically sequester ligand by binding to epitopes required forreceptor activation. One family of ligand traps includes follistatin andfollistatin-related proteins, which possess desirable functionalactivity based on multiple lines of evidence but have proven difficultto use as therapeutic agents. Thus, there is a need for such agents thatfunction as potent regulators of TGFβ superfamily signaling.

SUMMARY OF THE INVENTION

In part, the disclosure provides heteromeric protein complexes thatcomprise a follistatin-related fusion protein. Such heteromeric proteincomplexes optionally exhibit high affinity binding and inhibition ofligands, such as activin A, activin B, activin C, activin E, GDF8, andGDF11, and, optionally, exhibit improved production in recombinant celllines, improved properties for purification and/or extended serumhalf-life relative to native forms of follistatin-related proteins.

In certain aspects, protein complexes described herein comprise a firstpolypeptide covalently or non-covalently associated with a secondpolypeptide wherein the first polypeptide comprises the amino acidsequence of a follistatin-related polypeptide and the amino acidsequence of a first member of an interaction pair and the secondpolypeptide comprises the amino acid sequence of a second member of theinteraction pair. In other aspects, protein complexes described hereincomprise a first polypeptide non-covalently associated with a secondpolypeptide wherein the first polypeptide comprises the amino acidsequence of a follistatin-related polypeptide and the amino acidsequence of a first member of an interaction pair and the secondpolypeptide comprises the amino acid sequence of a second member of theinteraction pair.

Follistatin-related polypeptides described herein include polypeptidescomprising, consisting essentially of, or consisting of an amino acidsequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,97%, 98%, 99% or 100% to any one of SEQ ID Nos: 1-33. Optionally, thefollistatin-related polypeptide is connected directly to the firstmember of the interaction pair, or an intervening sequence, such as alinker, may be positioned between the amino acid sequence of thefollistatin-related polypeptide and the amino acid sequence of the firstmember of the interaction pair. Examples of linkers include thesequences TGGG, TGGGG, SGGGG, GGGGS, and GGG. Optionally, the firstpolypeptide may comprise additional amino acids (e.g., 1-50, 1-40, 1-30,1-20, or 1-10 amino acids) positioned C-terminal and/or N-terminal tothe first member of the interaction pair or the follistatin-relatedpolypeptide. Such additional amino acids positioned C-terminal orN-terminal to the first member of the interaction pair or thefollistatin-related polypeptide may confer a biological activity.Alternatively, such additional amino acids may confer no, orsubstantially no, biological activity. Optionally, such additional aminoacids are heterologous to the follistatin-related polypeptide andpreferably are not a follistatin-related polypeptide.

The second polypeptide may consist essentially of or consist of thesecond member of the interaction pair. Optionally, the secondpolypeptide may comprise additional amino acids (e.g., 1-50, 1-40, 1-30,1-20, or 1-10 amino acids) positioned C-terminal and/or N-terminal tothe second member of the interaction pair. Such additional amino acidspositioned C-terminal or N-terminal to the second member of theinteraction pair may confer a biological activity. Alternatively, suchadditional amino acids positioned C-terminal or N-terminal to the secondmember of the interaction pair may confer no, or substantially no,biological activity. Optionally, such additional amino acids positionedC-terminal or N-terminal to the second member of the interaction pairshould be heterologous to the follistatin-related polypeptide andpreferably are not a follistatin-related polypeptide.

Interaction pairs described herein are designed to promote dimerizationor form higher order multimers. In some embodiments, the interactionpair may be any two polypeptide sequences that interact to form acomplex, particularly a heterodimeric complex although operativeembodiments may also employ an interaction pair that forms a homodimericcomplex. The first and second members of the interaction pair may be anasymmetric pair, meaning that the members of the pair preferentiallyassociate with each other rather than self-associating. Accordingly,first and second members of an asymmetric interaction pair may associateto form a heterodimeric complex. Alternatively, the interaction pair maybe unguided, meaning that the members of the pair may associate witheach other or self-associate without substantial preference and thus mayhave the same or different amino acid sequences. Accordingly, first andsecond members of an unguided interaction pair may associate to form ahomodimeric complex or a heterodimeric complex. Optionally, the firstmember of the interaction action pair (e.g., an asymmetric pair or anunguided interaction pair) associates covalently with the second memberof the interaction pair. Optionally, the first member of the interactionaction pair (e.g., an asymmetric pair or an unguided interaction pair)associates non-covalently with the second member of the interactionpair.

Traditional Fc fusion proteins and antibodies are examples of unguidedinteraction pairs, whereas a variety of engineered Fc domains have beendesigned as asymmetric interaction pairs. Therefore, a first memberand/or a second member an interaction pair described herein may comprisea constant domain of an immunoglobulin, including, for example, the Fcportion of an immunoglobulin. Optionally, a first member of aninteraction pair may comprise an amino acid sequence that is derivedfrom an Fc domain of an IgG1, IgG2, IgG3, or IgG4 immunoglobulin of amammal, preferably a human. For example, the first member of aninteraction pair may comprise, consists essentially of, or consist of anamino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%,95%, 96%, 97%, 98%, 99% or 100° A to any one of SEQ ID Nos: 34-46.Optionally, a second member of an interaction pair may comprise an aminoacid sequence that is derived from an Fc domain of an IgG1, IgG2, IgG3,or IgG4. For example, the second member of an interaction pair maycomprise, consists essentially of, or consist of an amino acid sequencethat is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%,99% or 100° A to any one of SEQ ID Nos: 34-46. In some embodiments, afirst member and a second member of an interaction pair comprise Fcdomains derived from the same immunoglobulin class and subtype. In otherembodiments, a first member and a second member of an interaction paircomprise Fc domains derived from different immunoglobulin classes orsubtypes. Optionally, a first member and/or a second member of aninteraction pair (e.g., an asymmetric pair or an unguided interactionpair) comprise a modified constant domain of an immunoglobulin,including, for example, a modified Fc portion of an immunoglobulin. Forexample, protein complexes of the disclosure may comprise a firstmodified Fc portion of an IgG comprising an amino acid sequence that isat least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99° Aidentical to an amino acid sequence selected from the group: SEQ ID Nos34-46 and a second modified Fc portion of an IgG, which may be the sameor different from the amino acid sequence of the first modified Fcportion of the IgG, comprising an amino acid sequence that is at least80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99° Aidentical to an amino acid sequence selected from the group: SEQ ID Nos34-46.

In some embodiments, the first member of the interaction pair comprisesan amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acidsequence of SEQ ID NO: 39 and, optionally, the second member of theinteraction pair comprises an amino acid sequence that is at least 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° Aidentical to the amino acid sequence of SEQ ID NO: 40. In otherembodiments, the first member of the interaction pair comprises an aminoacid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%,96%, 97%, 98%, 99%, or 100° A identical to the amino acid sequence ofSEQ ID NO: 40 and, optionally, the second member of the interaction paircomprises an amino acid sequence that is at least 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° A identical to the aminoacid sequence of SEQ ID NO: 39.

In some embodiments, the first member of the interaction pair comprisesan amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99%, or 100° A identical to the amino acidsequence of SEQ ID NO: 41 and, optionally, the second member of theinteraction pair comprises an amino acid sequence that is at least 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° Aidentical to the amino acid sequence of SEQ ID NO: 42. In otherembodiments, the first member of the interaction pair comprises an aminoacid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%,96%, 97%, 98%, 99%, or 100° A identical to the amino acid sequence ofSEQ ID NO: 42 and, optionally, the second member of the interaction paircomprises an amino acid sequence that is at least 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° A identical to the aminoacid sequence of SEQ ID NO: 41.

In some embodiments, the first member of the interaction pair comprisesan amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acidsequence of SEQ ID NO: 43 and, optionally, the second member of theinteraction pair comprises an amino acid sequence that is at least 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° Aidentical to the amino acid sequence of SEQ ID NO: 44. In otherembodiments, the first member of the interaction pair comprises an aminoacid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%,96%, 97%, 98%, 99%, or 100° A identical to the amino acid sequence ofSEQ ID NO: 44 and, optionally, the second member of the interaction paircomprises an amino acid sequence that is at least 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° A identical to the aminoacid sequence of SEQ ID NO: 43.

In some embodiments, the first member of the interaction pair comprisesan amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acidsequence of SEQ ID NO: 45 and, optionally, the second member of theinteraction pair comprises an amino acid sequence that is at least 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° Aidentical to the amino acid sequence of SEQ ID NO: 46. In otherembodiments, the first member of the interaction pair comprises an aminoacid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%,96%, 9′7%, 98%, 99%, or 100° A identical to the amino acid sequence ofSEQ ID NO: 46 and, optionally, the second member of the interaction paircomprises an amino acid sequence that is at least 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100° A identical to the aminoacid sequence of SEQ ID NO: 45.

In some embodiments, the disclosure provides a fusion polypeptidecomprising an amino acid sequence of a follistatin-related polypeptideand the amino acid sequence of a member of an asymmetric interactionpair. Such fusion proteins may comprise a follistatin-relatedpolypeptide comprising an amino acid sequence that is at least 80%, 85%,90%, 95%, 97%, 98%, 99%, or 100% identical to the amino acid sequence ofany of SEQ ID Nos: 1-33. Optionally, the fusion polypeptide may comprisea linker polypeptide positioned between the amino acid sequence of thefollistatin-related polypeptide and the amino acid sequence of themember of the asymmetric interaction pair. In certain embodiments, themember of the asymmetric interaction pair comprises a constant domain ofan immunoglobulin such as an Fc domain of an immunoglobulin. Forexample, follistatin-related polypeptide fusion proteins of thedisclosure may comprise an asymmetric interaction pair that comprises anamino acid sequence that is derived from an Fc domain of an IgG1, IgG2,IgG3 or IgG4. In some embodiments, the asymmetric interaction pair ofthe fusion protein comprises, consists essentially of, or consists of anamino acid sequence that is at least 80%, 85%, 90%, 95%, 97%, 98%, 99%or 100° A identical to any one of SEQ ID NOs: 34-46. In someembodiments, the member of the asymmetric interaction pair of the fusionprotein comprises, consists essentially of, or consists of an amino acidsequence that is at least 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100° Aidentical to any one of SEQ ID NOs: 39-46.

Preferably protein complexes of the disclosure bind to one or moreligands selected from the group consisting of: GDF8, GDF-11, activin A,activin B, activin C, or activin E. Optionally, protein complexes of thedisclosure bind to one or more of these ligands with a K_(D) of greaterthan or equal to 10⁻⁸, 10⁻⁹, 10⁻¹⁰, 10⁻¹¹, or 10⁻¹². In someembodiments, protein complexes of the disclosure may inhibit signaling(e.g., signaling by SMADs 1, 2, 3, 5, and/or 8) by one or more ligandsselected from the group consisting of: GDF8, GDF-11, activin A, activinB, activin C, or activin E. Optionally, protein complexes of thedisclosure may inhibit signaling (e.g., signaling by SMADs 1, 2, 3, 5,and/or 8) by one or more of these ligands as measured in a cell-basedassay.

Optionally, protein complexes of the disclosure exhibit improvedpurification compared to native, monomeric follistatin-related peptides.Optionally protein complexes of the disclosure exhibit a serum half-lifeof at least 4, 6, 12, 24, 36, 48, or 72 hours in a mammal (e.g., a mouseor a human). Optionally, protein complexes of the disclosure may exhibita serum half-life of at least 6, 8, 10, 12, 14, 20, 25, or 30 days in amammal (e.g., a mouse or a human).

In certain aspects the disclosure provides nucleic acids encoding any ofthe first and/or second polypeptides disclosed herein. Nucleic acidsdisclosed herein may be operably linked to a promoter for expression,and the disclosure further provides cells transformed with suchrecombinant polynucleotides. Preferably the cell is a mammalian cellsuch as a COS cell or a CHO cell.

In certain aspects, the disclosure provides methods for making any ofthe first and second polypeptides disclosed herein as well as proteincomplexes comprising such a first and second polypeptide of thedisclosure. Such a method may include expressing any of the nucleicacids disclosed herein in a suitable cell (e.g., CHO cell or a COScell). Such a method may comprise: a) culturing a cell under conditionssuitable for expression of the first and/or second polypeptide of thedisclosure, wherein said cell is transformed with a first and/or secondpolypeptide expression construct; and b) recovering the first and/orsecond polypeptide so expressed. Similarly, a method may comprise: a)culturing a cell under conditions suitable for expression of the firstand second polypeptide of the disclosure, wherein said cell istransformed with a first and second polypeptide expression construct;and b) recovering the protein complex of the disclosure so expressed.First and/or second polypeptides described herein, as well as proteincomplex comprising first and second polypeptides of the disclosure, maybe recovered as crude, partially purified, or highly purified fractionsusing any of the well-known techniques for obtaining protein from cellcultures.

Any of the protein complexes described herein may be incorporated into apharmaceutical preparation. Optionally, such pharmaceutical preparationsare at least 80%, 85%, 90%, 95%, 97%, 98% or 99% pure with respect toother polypeptide components. In some embodiments, pharmaceuticalpreparation described herein comprises less than 20%, 15%, 10%, 5%, 3%,2%, or 1% of homodimers formed by the self-association of the first orsecond polypeptides. Optionally, pharmaceutical preparations disclosedherein may comprise one or more additional active agents.

In certain aspects, compositions of the present disclosure, includingfor example various protein complexes comprising follistatin-relatedfusion polypeptides disclosed herein, can be used for treating orpreventing a disease or condition that is associated with abnormalactivity of a follistatin-related fusion polypeptide and/or afollistatin ligand (e.g., myostatin, activins, GDF11). These diseases,disorders or conditions are generally referred to herein as“follistatin-associated conditions.” In certain embodiments, the presentdisclosure provides methods of treating or preventing an individual inneed thereof through administering to the individual a therapeuticallyeffective amount of a protein complex described herein. These methodsare particularly aimed at therapeutic and prophylactic treatments ofanimals, and more particularly, humans.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows multiple sequence alignment of Fc domains from human IgGisotypes using Clustal 2.1. Hinge regions are indicated by dottedunderline. Double underline indicates examples of positions that may beengineered in IgG1 Fc (SEQ ID NO: 34) to promote asymmetric chainpairing and the corresponding positions with respect to other isotypes(SEQ ID NOs: 35, 36, 38).

FIG. 2 shows a schematic example of a heteromeric protein complexcomprising a follistatin-related polypeptide for therapeutic use. The“follistatin-related polypeptide” (A) may be positioned C-terminal to,or N-terminal to, the “first member of an interaction pair” (B). Alinker, as well as other amino acid sequences, may be positioned betweenthe follistatin-related polypeptide and the first member of aninteraction pair. The first and second members of the interaction pair(B, C) may be an asymmetric pair, meaning that the members of the pairpreferentially associate with each other rather than self-associate, orthe interaction pair may be unguided, meaning that the members of thepair may associate with each other or self-associate without substantialpreference, and may have the same or different amino acid sequences.Traditional Fc fusion proteins and antibodies are examples of unguidedinteraction pairs, whereas a variety of engineered Fc domains have beendesigned as asymmetric interaction pairs. In the second polypeptide,additional amino acids may be positioned C-terminal or N-terminal to thesecond member of the interaction pair, and such amino acids may or maynot confer a biological activity but should be heterologous to thefollistatin-related polypeptide and preferably are not afollistatin-related polypeptide.

DETAILED DESCRIPTION OF THE INVENTION 1. Overview A. Regulation ofTissue Homeostasis by TGFβ Superfamily Ligands

TGFβ superfamily signaling pathways are critical for prenatal andpostnatal regulation of diverse cell types and tissues, includingmuscle, bone, adipose tissue, pancreatic function, hematopoietic cells,and others. Protein complexes described herein may bind to one or moreligands of the TGFβ superfamily, including for example a member of theactivin group, the Growth and Differentiation Factor (GDF) group, theBone Morphogenetic Protein (BMP) group or one or more other members ofthe superfamily. The superfamily ligand myostatin, encoded by the MSTNgene and also known as growth differentiation factor-8 (GDF8), is widelyrecognized as an endogenous inhibitor of skeletal muscle mass. Micehomozygous for a deletion of Mstn display robust increases in skeletalmuscle mass due to a combination of increased fiber number and musclefiber hypertrophy [McPherron et al. (1997) Nature 387:83-90]. Selectivepostnatal loss of myostatin signaling causes significant muscle fiberhypertrophy, thereby indicating that myostatin is an important regulatorof muscle homeostasis in adults [Lee et al. (2010) Mol Endocrinol24:1998-2008]. Naturally occurring mutations of myostatin are associatedwith increased skeletal muscle mass in humans, cattle, sheep, and dogs.For example, the Piedmontese and Belgian Blue cattle breeds carry aloss-of-function mutation in MSTN that causes a marked increase inmuscle mass [Grobet et al. (1997) Nat Genet 17:71-74]. In humans,inactive alleles of MSTN are associated with increased muscle mass and,reportedly, exceptional strength [Schuelke et al (2004) N Engl J Med2004, 350:2682-8.] Conversely, muscle wasting in humans associated withinfection by human immunodeficiency virus is accompanied by increasedMSTN expression [Gonzalez-Cadavid et al. (1998) Proc Natl Acad Sci USA95:14938-14943].

Inhibition of myostatin activity may be an effective strategy forincreasing muscle mass and strength in patients with inherited andacquired clinical conditions associated with debilitating muscle loss[Lee (2004) Annu Rev Cell Dev Biol 20:61-86; Tsuchida (2008) Curr OpinDrug Discov Dev 11:487-494; Rodino-Klapac et al. (2009) Muscle Nerve39:283-296]. Studies with mouse models of muscle disease have suggestedthat loss of myostatin signaling has beneficial effects in a wide rangeof disease settings, including muscular dystrophy, spinal muscularatrophy, cachexia, steroid-induced myopathy, and age-related sarcopenia.Moreover, loss of myostatin signaling has been shown to decrease fataccumulation and improve glucose metabolism in models of metabolicdiseases, raising the possibility that targeting myostatin may also haveapplications for diseases such as obesity and type 2 diabetes. Thusthere is considerable interest in identifying methods for therapeuticinhibition of myostatin signaling in vivo.

Like other superfamily ligands, myostatin is synthesized as a precursorconsisting of a signal peptide, an N-terminal prodomain, and aC-terminal mature domain. During synthesis, the myostatin prodomaininteracts noncovalently with mature myostatin to maintain thesemolecules in a conformation that facilitates dimerization [Harrison etal (2011) Growth Factors 29:174-186]. After cleavage of the dimericprecursor, the twin prodomains initially remain attached to the matureprotein, forming a latent complex [(Miyazono et al (1988) J Biol Chem263:6407-6415; Wakefield et al (1988) J Biol Chem 263:7646-7654; Brownet al (1990) Growth Factors 3:35-43]. To a greater degree than most TGFβligands, secreted myostatin initially exists in a latent or semilatentform whose activity can then be unmasked by other factors [Wolfman et al(2003) Proc Natl Acad Sci USA 100:15842-15846; Szlama et al (2013) FEBSJ 280:3822-3839]. Latent myostatin resides in the extracellular space,where the prodomain interacts with matrix proteins to regulate myostatinbioavailability [Anderson et al (2008) J Biol Chem 283:7027-7035; Sengleet al (2011) J Biol Chem 286:5087-5099], or enters the circulation,where the majority of myostatin exists in this latent form [Hill et al(2002) J Biol Chem 277:40735-40741]. In the extracellular compartment,interaction of the myostatin prodomain with the proteoglycan perlecanincreases concentrations of the latent complex near target cells. Latentmyostatin is converted to an active form by site-specific cleavage ofone or both associated prodomains by metalloproteases located in theextracellular matrix [Wolfman et al (2003) Proc Natl Acad Sci USA100:15842-15846].

Once released from its prodomain, myostatin exerts its cellular effectsby inducing formation of ternary complexes incorporating an activin typeII receptor (ActRIIA or ActRIIB) and an activin type I receptor(generally ALK4 or ALK7). These activated receptor complexes in turnphosphorylate Smad proteins (Smad2 and Smad3, Smad2/3), which enablesthe Smad proteins to form a transcriptional complex with Smad4 thatregulates expression of specific target genes [see, e.g., Mathews andVale (1991) Cell 65:973-982; Attisano et al. (1992) Cell 68: 97-108;Massague (2000) Nat. Rev. Mol. Cell Biol. 1:169-178]. Type I and type IIreceptors are transmembrane proteins, composed of a ligand-bindingextracellular domain with cysteine-rich region, a transmembrane domain,and a cytoplasmic domain with predicted serine/threonine specificity.Type II receptors are required for binding ligands, while type Ireceptors are essential for signaling.

Although myostatin is the best established case, one or more additionalligands that signal through ActRIIA and/or ActRIIB have also beenimplicated as endogenous inhibitors of muscle hypertrophy [Lee et al(2005) Proc Natl Acad Sci USA 102:18117-18122]. Activins are a family ofdimeric ligands within the TGFβ superfamily and are composed ofinhibin-β subunits. Specifically, activins include the homodimeric formsactivin A (β_(A)β_(A)), activin B (β_(A)β_(B)), activin C (β_(C)β_(C)),and activin E (β_(E)β_(E)), as well as heterodimeric forms, includingactivin AB (β_(A)β_(B)) and heterodimers containing β_(C) or β_(E). Inaddition, the structurally related heterodimer inhibin is an importantinhibitory regulator of activin signaling in various tissues.

Activins play diverse physiologic and pathologic roles. Multiple linesof evidence implicate activins as functioning in concert with myostatinto limit muscle mass, and activin antagonists can promote muscle growthor counteract muscle loss in vivo. [Link et al (1997) Exp Cell Res233:350-362; He et al (2005) Anat Embryol (Berl) 209:401-407; Souza etal (2008) Mol Endocrinol 22:2689-2702; Gilson et al (2009) Am J PhysiolEndocrinol Metab 297:E157-E164; Lee et al (2010) Mol Endocrinol24:1998-2008; Zhou et al. (2010) Cell 142:531-43]. Activins play a majorrole in bone homeostasis and are implicated as regulators oferythropoiesis [Maguer-Satta et al (2003) Exp Cell Res 282:110-120;Fields et al (2013) Expert Opin Investig Drugs 22:87-101]. Recentstudies have pointed to roles of activins in wound healing,angiogenesis, inflammation, immunity, fibrosis, and cancer [Antsiferovaet al (2012) J Cell Sci 125:3929-3937]. The activin/inhibin signalingpathway is associated with cancer of the ovaries, testes, and adrenalglands. In addition, activin A is a major inhibitory regulator ofhepatocyte proliferation, and dysregulated activin signaling has beenimplicated in hepatic diseases including inflammation, fibrosis, liverfailure, and cancer. [Kreidl et al (2009) World J Hepatol 1:17-27].Other functions of activins include induction of mesodermaldifferentiation, modulation of the cell cycle, support of neuronal cellsurvival, and coordination of endocrine cell activity [DePaolo et al.(1991) Proc Soc Exp Biol Med 198:500-512; Dyson et al (1997) Curr Biol7:81-84; Woodruff (1998) Biochem Pharmacol 55:953-963].

As described herein, agents that bind to “activin A” are agents thatspecifically bind to the β_(A) subunit, whether in the context of anisolated β_(A) subunit or as a dimeric complex (e.g., a β_(A)β_(A)homodimer or a β_(A)β_(B) heterodimer). In the case of a heterodimercomplex (e.g., a β_(A)β_(B) heterodimer), agents that bind to “activinA” are specific for epitopes present within the β_(A) subunit, but donot bind to epitopes present within the non-β_(A) subunit of the complex(e.g., the β_(B) subunit of the complex). Similarly, agents disclosedherein that antagonize (inhibit) “activin A” are agents that inhibit oneor more activities as mediated by a β_(A) subunit, whether in thecontext of an isolated β_(A) subunit or as a dimeric complex (e.g., aβ_(A)β_(A) homodimer or a β_(A)β_(B) heterodimer). In the case ofβ_(A)β_(B) heterodimers, agents that inhibit “activin A” are agents thatspecifically inhibit one or more activities of the β_(A) subunit, but donot inhibit the activity of the non-β_(A) subunit of the complex (e.g.,the β_(B) subunit of the complex). This principle applies also to agentsthat bind to and/or inhibit “activin B”, “activin C”, and “activin E”.Agents disclosed herein that antagonize “activin AB” are agents thatinhibit one or more activities as mediated by the β_(A) subunit and oneor more activities as mediated by the β_(B) subunit.

Growth differentiation factor-11 (GDF11), also known as bonemorphogenetic protein-11 (BMP11), is expressed in the tail bud, limbbud, maxillary and mandibular arches, and dorsal root ganglia duringmouse development [see, e.g., Nakashima et al. (1999) Mech. Dev. 80:185-189]. GDF11 plays a unique role in patterning both mesodermal andneural tissues [see, e.g., Gamer et al. (1999) Dev Biol., 208:222-32]and is a negative regulator of chondrogenesis and myogenesis indeveloping chick limb [see, e.g., Gamer et al. (2001) Dev Biol.229:407-20]. Expression of GDF11 in brain suggests that GDF11 may alsoregulate neural activity, and GDF11 inhibits neurogenesis in theolfactory epithelium [see, e.g., Wu et al. (2003) Neuron. 37:197-207].Recent studies implicate GDF11 as an important regulatory signal inerythropoiesis, particularly during late-stage erythroid differentiation[Suragani et al (2014) Nat Med 20:408-414]. In addition, GDF11 has beeninvestigated as a potential inhibitor of muscle hypertrophy based onstructural similarity to myostatin, shared signaling components, andGDF11 expression in skeletal muscle [McPherron et al. (1999) Nat. Genet.22: 260-264]. Although GDF11, like myostatin, is detectable in thegeneral circulation in a latent complex with its prodomain and inhibitsmuscle cell differentiation ex vivo [Souza et al (2008) Mol Endocrinol22:2689-2702], genetic studies have not revealed a role for GDF11 inregulating muscle size, fiber number, or fiber type, even undercondtions of myostatin deficiency [McPherron et al (2009) BMC Devel Biol9:24]. Thus, it remains to be firmly determined whether GDF11contributes to the regulation of muscle mass.

B. Extracellular Inhibitors of TGFβ Superfamily Ligands

In addition to the ligand-associated prodomains discussed above, severalother native proteins inhibit TGFβ superfamily ligands extracellularlyand thereby regulate the activity of these ligands in critical ways. Inhumans, soluble endogenous inhibitors of myostatin, activins, and GDF11include multiple follistatin isoforms, the product of thefollistatin-like gene (FSTL3) known as FLRG, and a pair of closelyrelated proteins named WFIKKN1 and WFIKKN2 based on their shared domainstructure which includes a whey acidic protein domain (W), afollistatin-Kazal domain (F), an immunoglobulin domain (I), two tandemdomains related to Kunitz-type protease inhibitor modules (KK), and anetrin domain (N). Follistatin, FLRG, WFIKKN1, and WFIKKN2 polypeptideseach contain one or more structural motifs generally referred to as“follistatin domains” which are important for selective binding to TGFβsuperfamily ligands. Therefore, as disclosed herein, the term“follistatin-related polypeptides” includes, for example, nativefollistatin, FLRG, WFIKKN1, and WFIKKN2 sequences, as well as variantsand truncations thereof.

Best studied among extracellular inhibitors of myostatin, activins,and/or GDF11 is follistatin, a single gene (FST) from which aregenerated multiple isoforms. Follistatin is an autocrine glycoproteinexpressed in nearly all tissues of higher animals. It was initiallyisolated from follicular fluid and was identified as a protein fractionthat inhibited follicle-stimulating hormone (FSH) secretion from theanterior pituitary [Esch et al. (1987) Mol Endocrinol 1:849-855]. Theimportance of follistatin in TGFβ superfamily signaling is illustratedby the multiple defects and perinatal death observed infollistatin-deficient mice [Matzuk et al (1995) Nature 374:360-363].Postnatally, follistatin promotes muscle growth by inhibiting myostatinand activins [Lee et al (2010) Mol Endocrinol 24:1998-2008] andpotentially GDF11. The biologic activity of follistatin stems from itsability to bind these ligands with high affinity and thereby preventinteraction of the ligand with its cell-surface receptor ActRIIA orActRIIB [Nakamura et al (1990) Science 247:836-838; Kogawa et al (1991)Endocrinology 128:1434-1440; Schneyer et al (1994) Endocrinology135:667-674; de Winter et al (1996) Mol Cell Endocrinol 116:105-114;Thompson et al (2005) Dev Cell 9:535-543]. In addition, follistatincontains a heparin-binding domain that in some isoforms facilitatesfollistatin interaction with proteoglycans at the cell surface [Inouyeet al (1992) Mol Cell Endocrinol 90:1-6], thereby maintaining higherconcentrations of follistatin near the sites of ligand action.Furthermore, binding of follistatin (FST288) to myostatin substantiallyincreases the affinity of follistatin for heparin [Cash et al (2009)EMBO J 28:2662-2676], thereby suggesting that ligand binding promotescell-surface localization of the follistatin-myostatin complex.

Follistatin contains three repeats of a distinctive structural motifknown as a “follistatin domain”, which encompasses a conserved linearpattern of ten cysteines and forms a characteristic arrangement ofintramolecular disulfide bonds [Esch et al. (1987) Mol Endocrinol1:849-855]. A follistatin domain is defined herein as an amino aciddomain, or a nucleic acid sequence encoding an amino acid domain,characterized by cysteine-rich repeats. A follistatin domain typicallyencompasses a span of 65-90 amino acids and contains ten conservedcysteine residues and a region similar to Kazal serine proteaseinhibitor domains. Thus, follistatin domains are sometimes referred toas “follistatin/Kazal domains” or “follistatin/Kazal-like domains”. Ingeneral, the loop regions between the cysteine residues exhibit sequencevariability, but some conservation is present. The loop between thefourth and fifth cysteines is usually short, containing only one or twoamino acids. The amino acids in the loop between the seventh and eighthcysteines are generally the most highly conserved, containing aconsensus sequence of (G,A)-(S,N)-(S,N,T)-(D,N)-(G,N) followed by a(T,S)-Y motif. The region between the ninth and tenth cysteinesgenerally contains a motif incorporating two hydrophobic residues(specifically V, I, or L) separated by another amino acid.

The term “follistatin polypeptide” is used to refer to polypeptidescomprising any naturally occurring polypeptide of the follistatin familyas well as any variants thereof (including mutants, fragments, fusions,and peptidomimetic forms) that retain a useful activity, including, forexample, ligand binding (e.g., myostatin, GDF11, activin A, activin B).For example, follistatin polypeptides include polypeptides comprising anamino acid sequence derived from the sequence of any known follistatinhaving a sequence at least about 80% identical to the sequence of afollistatin polypeptide (SEQ ID NOs: 1-17), and preferably at least 85%,90%, 95%, 97%, 99% or greater identity to any of SEQ ID NOs: 1-17. Theterm “follistatin fusion polypeptide” may refer to fusion proteins thatcomprise any of the polypeptides mentioned above along with aheterologous (non-follistatin) portion. An amino acid sequence isunderstood to be heterologous to follistatin if it is not uniquely foundin the long (315 amino acid) form of human follistatin, represented bySEQ ID NO:3. Many examples of heterologous portions are provided herein,and such heterologous portions may be immediately adjacent, by aminoacid sequence, to the follistatin polypeptide portion of a fusionprotein, or separated by intervening amino acid sequence, such as alinker or other sequence. In addition, methods for making and testinglibraries of polypeptides are described herein and such methods alsopertain to making and testing variants of follistatin.

Follistatin is a single-chain polypeptide with a range of molecularweights from 31 to 49 kDa based on alternative mRNA splicing andvariable glycosylation of the protein. Alternatively spliced mRNAs fromthe follistatin gene encode isoforms of 288 amino acids (i.e., FST288)and 315 amino acids (i.e., FST315), and the latter can be processedproteolytically to yield yet another isoform, follistatin 303 (FST303).Analysis of the amino acid sequence of native human follistatinpolypeptide has revealed that it comprises five domains: a signalsequence (amino acids 1-29 of SEQ ID NO:1), an N-terminal domain(FST_(ND)) (amino acids 30-94 of SEQ ID NO:1), follistatin domain-1(FST_(FD1)) (amino acids 95-164 of SEQ ID NO:1), follistatin domain-2(FST_(FD2)) (amino acids (168-239 of SEQ ID NO:1), and follistatindomain-3 (FST_(FD3)) (amino acids 245-316 of SEQ ID NO:1). See Shimanskiet al (1988) Proc Natl Acad Sci USA 85:4218-4222.

The human follistatin-288 (FST288) precursor has the following aminoacid sequence (SEQ ID NO: 1) (NCBI Reference Sequence NP_006341; UniprotP19883-2), with the signal peptide indicated by dotted underline, theN-terminal domain (FST_(ND)) indicated by dashed underline, and thefollistatin domains 1-3 (FST_(FD1), FST_(FD2), FST_(FD3)) indicated bysolid underline.

(SEQ ID NO: 1)   1

 51

101 GPGKKCRMNK KNKPRCVCAP DCSNITWKGP VCGLDGKTYR NECALLKARC 151KEQPELEVQY QGRCKKTCRD VFCPGSSTCV VDQTNNAYCV TCNRICPEPA 201SSEQYLCGND GVTYSSACHL RKATCLLGRS IGLAYEGKCI KAKSCEDIQC 251TGGKKCLWDF KVGRGRCSLC DELCPDSKSD EPVCASDNAT YASECAMKEA 301ACSSGVLLEV KHSGSCN

The mature (processed) human follistatin variant FST288 has thefollowing amino acid sequence (SEQ ID NO: 2) with the N-terminal domainindicated by dashed underline and the follistatin domains 1-3 indicatedby solid underline. Moreover, it will be appreciated that any of theinitial amino acids G or N, prior to the first cysteine may be removedby processing or intentionally eliminated without any consequence, andpolypeptides comprising such slightly smaller polypeptides are furtherincluded.

(SEQ ID NO: 2)   1

 51

101 PVCGLDGKTY RNECALLKAR CKEQPELEVQ YQGRCKKTCR DVFCPGSSTC 151VVDQTNNAYC VTCNRICPEP ASSEQYLCGN DGVTYSSACH LRKATCLLGR 201SIGLAYEGKC IKAKSCEDIQ CTGGKKCLWD FKVGRGRCSL CDELCPDSKS 251DEPVCASDNA TYASECAMKE AACSSGVLLE VKHSGSCN

The human follistatin-315 (FST315) precursor has the following aminoacid sequence (SEQ ID NO: 3) (NCBI Reference Sequence NP_037541.1;Uniprot P19883), with the signal peptide indicated by dotted underline,the N-terminal domain (FST_(ND)) indicated by dashed underline, and thefollistatin domains 1-3 (FST_(FD1), FST_(FD2), FST_(FD3)) indicated bysolid underline.

(SEQ ID NO: 3)   1

 51

101 GPGKKCRMNK KNKPRCVCAP DCSNITWKGP VCGLDGKTYR NECALLKARC 151KEQPELEVQY QGRCKKTCRD VFCPGSSTCV VDQTNNAYCV TCNRICPEPA 201SSEQYLCGND GVTYSSACHL RKATCLLGRS IGLAYEGKCI KAKSCEDIQC 251TGGKKCLWDF KVGRGRCSLC DELCPDSKSD EPVCASDNAT YASECAMKEA 301ACSSGVLLEV KHSGSCNSIS EDTEEEEEDE DQDYSFPISS ILEW

Mature (processed) human FST315 has the following amino acid sequence(SEQ ID NO: 4) with the N-terminal domain indicated by dashed underlineand the follistatin domains 1-3 indicated by solid underline. Moreover,it will be appreciated that any of the initial amino acids G or N, priorto the first cysteine may be removed by processing or intentionallyeliminated without any consequence, and polypeptides comprising suchslightly shorter polypeptides are further included.

(SEQ ID NO: 4)   1

 51

101 PVCGLDGKTY RNECALLKAR CKEQPELEVQ YQGRCKKTCR DVFCPGSSTC 151VVDQTNNAYC VTCNRICPEP ASSEQYLCGN DGVTYSSACH LRKATCLLGR 201SIGLAYEGKC IKAKSCEDIQ CTGGKKCLWD FKVGRGRCSL CDELCPDSKS 251DEPVCASDNA TYASECAMKE AACSSGVLLE VKHSGSCNSI SEDTEEEEED 301EDQDYSFPIS SILEW

Follistatin-related polypeptides of the disclosure may include anynaturally occurring domain of a follistatin protein as well as variantsthereof (e.g., mutants, fragments, and peptidomimetic forms) that retaina useful activity. For example, it is well-known that FST315 and FST288have high affinity for myostatin, activins (activin A and activin B),and GDF11 and that the follistatin domains (e.g., FST_(ND), FST_(FD1),FST_(FD2), and FST_(FD3)) are thought to be involved in the binding ofsuch TGFβ ligands. However, there is evidence that each of these fourdomains has a different affinity for these TGF-β ligands. For example, arecent study has demonstrated that polypeptide constructs comprisingonly the N-terminal domain and two FST_(FD1) domains in tandem retainedhigh affinity for myostatin, demonstrated little or no affinity foractivins, and promoted systemic muscle growth when introduced into amouse by gene expression [Nakatani et al (2008) FASEB 22:478-487].Accordingly, the present disclosure encompasses, in part, variantfollistatin proteins that demonstrate selective binding and/orinhibition of a given TGFβ ligand relative to the naturally occurringFST protein (e.g., maintaining high-affinity for myostain while having asignificantly reduced affinity for activin).

In certain aspects, the disclosure includes polypeptides comprising theFST_(ND) domain, as set forth below (SEQ ID NO: 5), and, for example,one or more heterologous polypeptides, and moreover, it will beappreciated that any of the initial amino acids G or N, prior to thefirst cysteine may be deleted, as in the example shown below (SEQ ID NO:6).

(SEQ ID NO: 5) 1 GNCWLRQAKN GRCQVLYKTE LSKEECCSTG RLSTSWTEED VNDNTLFKWM51 IFNGGAPNCI PCKET (SEQ ID NO: 6) 1 CWLRQAKNGR CQVLYKTELS KEECCSTGRLSTSWTEEDVN DNTLFKWMIF 51 NGGAPNCIPC KET

In certain aspects, the disclosure includes polypeptides comprising theFST_(FD1) domain (SEQ ID NO: 7) which contains the minimal coreactivities of myostatin (and/or GDF11) binding along with heparinbinding as set forth below, and, for example, one or more heterologouspolypeptides.

(SEQ ID NO: 7) 1 CENVDCGPGK KCRMNKKNKP RCVCAPDCSN ITWKGPVCGL DGKTYRNECA51 LLKARCKEQP ELEVQYQGRC

In certain aspects, the disclosure includes polypeptides comprising theFST_(FD2) domain (SEQ ID NO: 8) and/or the FST_(FD3) domain (SEQ ID NO:9) as set forth below, and, for example, one or more heterologouspolypeptides.

(SEQ ID NO: 8) 1 CRDVFCPGSS TCVVDQTNNA YCVTCNRICP EPASSEQYLC GNDGVTYSSA51 CHLRKATCLL GRSIGLAYEG KC (SEQ ID NO: 9) 1 CEDIQCTGGK KCLWDFKVGRGRCSLCDELC PDSKSDEPVC ASDNATYASE 51 CAMKEAACSS GVLLEVKHSG SC

An FST_(FD1) sequence may be advantageously maintained in structuralcontext by expression as a polypeptide further comprising the FST_(ND)domain. Accordingly, the disclosure includes polypeptides comprising theFST_(ND)-FST_(FD1) sequence, as set forth below (SEQ ID NO:10), and, forexample, one or more heterologous polypeptides, and moreover, it will beappreciated that any of the initial amino acids G or N, prior to thefirst cysteine may be removed by processing or intentionally eliminatedwithout any consequence, and polypeptides comprising such slightlyshorter polypeptides are further included.

(SEQ ID NO: 10) 1 CWLRQAKNGR CQVLYKTELS KEECCSTGRL STSWTEEDVN DNTLFKWMIF51 NGGAPNCIPC KETCENVDCG PGKKCRMNKK NKPRCVCAPD CSNITWKGPV 101 CGLDGKTYRNECALLKARCK EQPELEVQYQ GRC

As demonstrated by Nakatani et al., a FST_(ND)-FST_(FD1)-FST_(FD1)construct is sufficient to confer systemic muscle growth whengenetically expressed in a mouse, and accordingly the disclosureincludes polypeptides comprising the amino acid sequence below (SEQ IDNO: 11) and, for example, one or more heterologous polypeptides.

(SEQ ID NO: 11) 1 CWLRQAKNGR CQVLYKTELS KEECCSTGRL STSWTEEDVN DNTLFKWMIF51 NGGAPNCIPC KETCENVDCG PGKKCRMNKK NKPRCVCAPD CSNITWKGPV 101 CGLDGKTYRNECALLKARCK EQPELEVQYQ GRCCENVDCG PGKKCRMNKK 151 NKPRCVCAPD CSNITWKGPVCGLDGKTYRN ECALLKARCK EQPELEVQYQ 201 GRC

While the FST_(FD1) sequence confers myostatin and GDF11 binding, it hasbeen demonstrated that activins, particularly activin A but also activinB, are also negative regulators of muscle, and therefore a follistatinpolypeptide that inhibits both the myostatin/GDF11 ligand group and theactivin A/activin B ligand group may provide a more potent muscleeffect. Given that FST_(FD2) confers activin A and B binding, thedisclosure provides polypeptides comprising FST_(Fm)-FST_(FD2) (SEQ IDNO: 12) and FST_(Fm)-FST_(FD2)-FST_(FD3) (SEQ ID NO: 13), as well asconstructs comprising FST_(ND)-FST_(Fm)-FST_(FD2) (SEQ ID NO: 14) and,for example, one or more heterologous polypeptides.

(SEQ ID NO: 12) 1 CENVDCGPGK KCRMNKKNKP RCVCAPDCSN ITWKGPVCGL DGKTYRNECA51 LLKARCKEQP ELEVQYQGRC CRDVFCPGSS TCVVDQTNNA YCVTCNRICP 101 EPASSEQYLCGNDGVTYSSA CHLRKATCLL GRSIGLAYEG KC (SEQ ID NO: 13) 1 CENVDCGPGKKCRMNKKNKP RCVCAPDCSN ITWKGPVCGL DGKTYRNECA 51 LLKARCKEQP ELEVQYQGRCCRDVFCPGSS TCVVDQTNNA YCVTCNRICP 101 EPASSEQYLC GNDGVTYSSA CHLRKATCLLGRSIGLAYEG KCCEDIQCTG 151 GKKCLWDFKV GRGRCSLCDE LCPDSKSDEP VCASDNATYASECAMKEAAC 201 SSGVLLEVKH SGSC (SEQ ID NO: 14) 1 CWLRQAKNGR CQVLYKTELSKEECCSTGRL STSWTEEDVN DNTLFKWMIF 51 NGGAPNCIPC KETCENVDCG PGKKCRMNKKNKPRCVCAPD CSNITWKGPV 101 CGLDGKTYRN ECALLKARCK EQPELEVQYQ GRCCRDVFCPGSSTCVVDQT 151 NNAYCVTCNR ICPEPASSEQ YLCGNDGVTY SSACHLRKAT CLLGRSIGLA201 YEGKC

A follistatin polypeptide of 291 amino acids (representing a truncationof the naturally occurring FST315) may have advantageous properties incertain embodiments. Accordingly, unprocessed (SEQ ID NO: 15) and matureFST291 (SEQ ID NO: 16) polypeptides are included in the disclosure andmay be combined with heterologous proteins. Moreover, it will beappreciated that any of the initial amino acids G or N, prior to thefirst cysteine may be removed by processing or intentionally eliminatedwithout any consequence, and polypeptides comprising such slightlyshorter polypeptides are further included, such as the example shownbelow (SEQ ID NO: 17).

(SEQ ID NO: 15) 1 MVRARHQPGG LCLLLLLLCQ FMEDRSAQAG NCWLRQAKNG RCQVLYKTEL51 SKEECCSTGR LSTSWTEEDV NDNTLFKWMI FNGGAPNCIP CKETCENVDC 101 GPGKKCRMNKKNKPRCVCAP DCSNITWKGP VCGLDGKTYR NECALLKARC 151 KEQPELEVQY QGRCKKTCRDVFCPGSSTCV VDQTNNAYCV TCNRICPEPA 201 SSEQYLCGND GVTYSSACHL RKATCLLGRSIGLAYEGKCI KAKSCEDIQC 251 TGGKKCLWDF KVGRGRCSLC DELCPDSKSD EPVCASDNATYASECAMKEA 301 ACSSGVLLEV KHSGSCNSIS (SEQ ID NO: 16) 1 GNCWLRQAKNGRCQVLYKTE LSKEECCSTG RLSTSWTEED VNDNTLFKWM 51 IFNGGAPNCI PCKETCENVDCGPGKKCRMN KKNKPRCVCA PDCSNITWKG 101 PVCGLDGKTY RNECALLKAR CKEQPELEVQYQGRCKKTCR DVFCPGSSTC 151 VVDQTNNAYC VTCNRICPEP ASSEQYLCGN DGVTYSSACHLRKATCLLGR 201 SIGLAYEGKC IKAKSCEDIQ CTGGKKCLWD FKVGRGRCSL CDELCPDSKS251 DEPVCASDNA TYASECAMKE AACSSGVLLE VKHSGSCNSI S (SEQ ID NO: 17) 1CWLRQAKNGR CQVLYKTELS KEECCSTGRL STSWTEEDVN DNTLFKWMIF 51 NGGAPNCIPCKETCENVDCG PGKKCRMNKK NKPRCVCAPD CSNITWKGPV 101 CGLDGKTYRN ECALLKARCKEQPELEVQYQ GRCKKTCRDV FCPGSSTCVV 151 DQTNNAYCVT CNRICPEPAS SEQYLCGNDGVTYSSACHLR KATCLLGRSI 201 GLAYEGKCIK AKSCEDIQCT GGKKCLWDFK VGRGRCSLCDELCPDSKSDE 251 PVCASDNATY ASECAMKEAA CSSGVLLEVK HSGSCNSIS

Follistatin proteins herein may be referred to as FST. If followed by anumber, such as FST288, this indicates that the protein is the288-amino-acid isoform of follistatin. If presented as FST288-Fc, thisindicates that an Fc domain is fused to the C-terminus of FST288, whichmay or may not include an intervening linker. The Fc in this instancemay be any immunoglobulin Fc portion as that term is defined herein. Ifpresented as FST288-G1Fc, this indicates that the Fc portion of humanIgG1 is fused at the C-terminal of FST288. Unless indicated to thecontrary, a protein described with this nomenclature will represent ahuman follistatin protein.

Closely related to the native follistatin isoforms encoded by FSTN is anaturally occurring protein encoded by the FSTL3 gene and knownalternatively as follistatin-related gene (FLRG), follistatin-like 3(FSTL3), or follistatin-related protein (FSRP) [Schneyer et al (2001)Mol Cell Endocrinol 180:33-38]. Like follistatin, FLRG binds tomyostatin, activins, and GDF11 with high affinity and thereby inhibitstheir bioactivity in vivo [Sidis et al (2006) Endocrinology147:3586-3597]. Unlike follistatin, FLRG does not possess aheparin-binding sequence, cannot bind to cell-surface proteoglycans, andtherefore is a less potent inhibitor of activin than is FST288 in theimmediate vicinity of the cell surface. In contrast to follistatin, FLRGalso circulates in the blood bound to mature myostatin, and thusresembles myostatin propeptide in this regard [Hill et al (2002) J BiolChem 277:40735-40741. Unlike follistatin, FLRG deficiency in mice is notlethal, although it does cause a variety of metabolic phenotypes[Mukherjee et al (2007) Proc Natl Acad Sci USA 104:1348-1353].

The overall structure of FLRG closely resembles that of follistatin.Native human FLRG precursor is a single-chain polypeptide whichcomprises four domains: a signal sequence (amino acids 1-26 of SEQ IDNO: 18), an N-terminal domain (FLRG_(ND)) (amino acids 38-96 of SEQ IDNO: 18), and two follistatin domains referred to herein as FLRG_(FD1)(amino acids 99-167 of SEQ ID NO: 18) and FLRG_(FD2) (amino acids171-243 of SEQ ID NO: 18).

The term “FLRG polypeptide” is used to refer to polypeptides comprisingany naturally occurring polypeptide of FLRG as well as any variantsthereof (including mutants, fragments, fusions, and peptidomimeticforms) that retain a useful activity. In certain preferred embodiments,FLRG polypeptides of the disclosure bind to and/or inhibit activity ofmyostatin, GDF11, or activin, particularly activin A (e.g.,ligand-mediated activation of ActRIIA and/or ActRIIB Smad2/3 signaling).Variants of FLRG polypeptides that retain ligand binding properties canbe identified using routine methods to assay interactions between FLRGand ligands (see, e.g., U.S. Pat. No. 6,537,966). In addition, methodsfor making and testing libraries of polypeptides are described hereinand such methods also pertain to making and testing variants of FLRG.

For example, FLRG polypeptides include polypeptides comprising an aminoacid sequence derived from the sequence of any known FLRG having asequence at least about 80% identical to the sequence of a FLRGpolypeptide (for example, SEQ ID NOs: 18-25), and optionally at least85%, 90%, 95%, 97%, 99% or greater identity to any of SEQ ID NOs: 18-25.The term “FLRG fusion polypeptide” may refer to fusion proteins thatcomprise any of the polypeptides mentioned above along with aheterologous (non-FLRG) portion. An amino acid sequence is understood tobe heterologous to FLRG if it is not uniquely found in human FLRG,represented by SEQ ID NO: 18. Many examples of heterologous portions areprovided herein, and such heterologous portions may be immediatelyadjacent, by amino acid sequence, to the FLRG polypeptide portion of afusion protein, or separated by intervening amino acid sequence, such asa linker or other sequence.

The human FLRG precursor has the following amino acid sequence (SEQ IDNO: 18) (amino acids 1-263 of NCBI Reference Sequence NP_005851.1), withthe signal peptide indicated by dotted underline, the N-terminal domain(FLRG_(ND)) indicated by dashed underline, and the two follistatindomains (FST_(FD1), FST_(FD2)) indicated by solid underline.

(SEQ ID NO: 18)   1

 51

101 GVECGPGKAC RMLGGRPRCE CAPDCSGLPA RLQVCGSDGA TYRDECELRA 151ARCRGHPDLS VMYRGRCRKS CEHVVCPRPQ SCVVDQTGSA HCVVCRAAPC 201PVPSSPGQEL CGNNNVTYIS SCHMRQATCF LGRSIGVRHA GSCAGTPEEP 251PGGESAEEEE NFV

Mature (processed) human FLRG comprises the following amino acidsequence (SEQ ID NO: 19) (amino acids 38-263 of NCBI Reference SequenceNP_005851.1) with the N-terminal domain indicated by dashed underlineand the two follistatin domains indicated by solid underline. Moreover,it will be appreciated that any of the amino acids (positions 27-37 ofSEQ ID NO: 18) prior to the first cysteine (position 38 in SEQ ID NO:18) may be included without substantial consequence, and polypeptidescomprising such slightly longer polypeptides are included.

(SEQ ID NO: 19)   1

 51

101 DGATYRDECE LRAARCRGHP DLSVMYRGRC RKSCEHVVCP RPQSCVVDQT 151GSAHCVVCRA APCPVPSSPG QELCGNNNVT YISSCHMRQA TCFLGRSIGV 201RHAGSCAGTP EEPPGGESAE EEENFV

In certain aspects, the disclosure includes polypeptides comprising theFLRG_(ND) domain (SEQ ID NO: 20), which interacts differently withmyostatin compared with activin A [Cash et al (2012) J Biol Chem287:1043-1053], as set forth below, and, for example, one or moreheterologous polypeptides. Moreover, it will be appreciated that any ofthe initial amino acids G or N prior to the first cysteine may bedeleted, as in the example shown below (SEQ ID NO: 21).

(SEQ ID NO: 20) 1 GVCWLQQGQE ATCSLVLQTD VTRAECCASG NIDTAWSNLT HPGNKINLLG51 FLGLVHCLPC (SEQ ID NO: 21) 1 CWLQQGQEAT CSLVLQTDVT RAECCASGNIDTAWSNLTHP GNKINLLGFL 51 GLVHCLPC

In certain aspects, the disclosure includes polypeptides comprising theFLRG_(FD1) domain as set forth below (SEQ ID NO: 22), and, for example,one or more heterologous polypeptides.

(SEQ ID NO: 22) 1 CDGVECGPGK ACRMLGGRPR CECAPDCSGL PARLQVCGSD GATYRDECEL51 RAARCRGHPD LSVMYRGRC

In certain aspects, the disclosure includes polypeptides comprising theFST_(FD2) domain as set forth below (SEQ ID NO: 23), and, for example,one or more heterologous polypeptides.

(SEQ ID NO: 23) 1 CEHVVCPRPQ SCVVDQTGSA HCVVCRAAPC PVPSSPGQEL CGNNNVTYIS51 SCHMRQATCF LGRSIGVRHA GSC

A FLRG_(FD) sequence may be advantageously maintained in structuralcontext by expression as a polypeptide further comprising the FLRG_(ND)domain. Accordingly, the disclosure includes polypeptides comprising theFLRG_(ND)-FLRG_(FD1) sequence (SEQ ID NO: 24) and theFLRG_(ND)-FLRG_(FD1)-FLRG_(FD2) sequence (SEQ ID NO: 25), as set forthbelow, and, for example, one or more heterologous polypeptides.Moreover, it will be appreciated that any of the initial amino acids Gor N, prior to the first cysteine may be removed by processing orintentionally eliminated without any consequence, and polypeptidescomprising such slightly shorter polypeptides are further included.

(SEQ ID NO: 24) 1 GVCWLQQGQE ATCSLVLQTD VTRAECCASG NIDTAWSNLT HPGNKINLLG51 FLGLVHCLPC KCDGVECGPG KACRMLGGRP RCECAPDCSG LPARLQVCGS 101 DGATYRDECELRAARCRGHP DLSVMYRGRC (SEQ ID NO: 25) 1 GVCWLQQGQE ATCSLVLQTD VTRAECCASGNIDTAWSNLT HPGNKINLLG 51 FLGLVHCLPC KCDGVECGPG KACRMLGGRP RCECAPDCSGLPARLQVCGS 101 DGATYRDECE LRAARCRGHP DLSVMYRGRC CEHVVCPRPQ SCVVDQTGSA151 HCVVCRAAPC PVPSSPGQEL CGNNNVTYIS SCHMRQATCF LGRSIGVRHA 201 GSC

If presented as FLRG-Fc, this indicates that an Fc domain is fused tothe C-terminus of FLRG, which may or may not include an interveninglinker. The Fc in this instance may be any immunoglobulin Fc portion asthat term is defined herein. If presented as FLRG-G1Fc, this indicatesthat the Fc portion of IgG1 is fused at the C-terminus of FLRG. Unlessindicated to the contrary, a protein described with this nomenclaturewill represent a human FLRG protein.

In addition to FSTN and FSTL3, two other genes have been identifiedwhose protein products contain a follistatin domain motif and functionas extracellular inhibitors of myostatin and GDF11. In humans, theseclosely related genes are named WFIKKN1 and WFIKKN2 based on theirshared domain structure which includes a whey acidic protein domain, afollistatin-Kazal domain, an immunoglobulin domain, two tandem domainsrelated to Kunitz-type protease inhibitor modules, and a netrin domain[Trexler et al (2001) Proc Natl Acad Sci USA 98:3705-3709; Trexler et al(2002) Biol Chem 383:223-228]. WFIKKN2 is also known as WFIKKN-relatedprotein (WFIKKNRP), and murine counterparts of these proteins are namedGDF-associated serum protein-2 (Gasp2) and Gasp1, respectively, based ontheir ligand-binding ability [Hill et al (2003) Mol Endocrinol17:1144-1154]. Native WFIKKN1 (GASP2) and WFIKKN2 (GASP1) proteinspossess overlapping activity profiles that are nonetheless distinct fromeach other and from follistatin or FLRG. WFIKKNs bind with high affinityto myostatin, GDF11, and in some cases to myostatin propeptide, withbinding to mature ligand mediated primarily by the follistatin domain(WFIKKN1_(FD), WFIKKN2_(FD)) and propeptide binding mediated primarilyby the netrin domain [Hill et al (2003) Mol Endocrinol 17:1144-1154;Kondas et al (2008) J Biol Chem 283:23677-23684]. In contrast tofollistatin and FLRG, neither WFIKKN1 nor WFIKKN2 bind activin [Szlámaet al (2010) FEBS J 277:5040-5050]. WFIKKN proteins inhibit myostatinand GDF11 signaling by blocking their access to activin type IIreceptors [Lee et al (2013) Proc Natl Acad Sci USA 110:E3713-E3722]. Dueto the presence of several protease inhibitory modules in both WFIKKNs,it is likely that they also regulate the action of multiple types ofproteases. The tissue expression patterns of WFIKKN1 differ prenatallyand postnatally from that of WFIKKN2, thus supporting the view that thetwo proteins serve distinct roles [Trexler et al (2002) Biol Chem383:223-228].

Additional lines of evidence implicate WFIKKNs in the regulation ofskeletal muscle mass. Mice with homozygous deletion of WFIKKN1 orWFIKKN2 display phenotypes consistent with overactivity of myostatin andGDF11, including a reduction in muscle weight, a shift in fiber typefrom fast glycolytic type IIb fibers to fast oxidative type Ha fibers,and impaired muscle regeneration (Lee et al (2013) Proc Natl Acad SciUSA 110:E3713-E3722]. Conversely, broad overexpression of WFIKKN2 inmice leads mainly to a hypermuscular phenotype [Monestier et al (2012)BMC Genomics 13:541-551. Although both WFIKKN proteins bind tomyostatin, WFIKKIN1 and WFIKKN2 may interact differently with myostatinpropeptide and thus may differentially block the activation of ActRIIAor ActRIIB by semilatent myostatin, which is the native complex betweenmyotatin and a single myostatin propeptide chain [Szláma et al (2013)FEBS J 280:3822-3839]. Taken together, follistatin-related fusionproteins comprising a WFIKKN1 or WFIKKN2 polypeptide as disclosed hereinwould be predicted to increase skeletal muscle mass in vivo withoutcausing potentially undesirable effects associated with inhibition ofendogenous activins.

The term “WFIKKN1 polypeptide” is used to refer to polypeptidescomprising any naturally occurring polypeptide of WFIKKN1 as well as anyvariants thereof (including mutants, fragments, fusions, andpeptidomimetic forms) that retain a useful activity. In certainpreferred embodiments, WFIKKN1 polypeptides of the disclosure bind toand/or inhibit activity of myostatin, myostatin propeptide, complexesbetween myostatin and its propeptide, GDF11, and potentially activins(e.g., ligand-mediated activation of ActRIIA and/or ActRIIB Smad2/3signaling). Variants of WFIKKN1 polypeptides that retain ligand bindingproperties can be identified using routine methods to assay interactionsbetween WFIKKN1 and ligands [see, e.g., Kondas et al (2008) J Biol Chem283:23677-23684; Szláma et al (2013) FEBS J 280:3822-3839]. In addition,methods for making and testing libraries of polypeptides are describedherein and such methods also pertain to making and testing variants ofWFIKKN1.

For example, WFIKKN1 polypeptides include polypeptides comprising anamino acid sequence derived from the sequence of any known WFIKKN1polypeptide having a sequence at least about 80% identical to thesequence of a WFIKKN1 polypeptide (for example, SEQ ID NOs: 26-28), andoptionally at least 85%, 90%, 95%, 97%, 99% or greater identity to anyof SEQ ID NOs: 26-28. The term “WFIKKN1 fusion polypeptide” may refer tofusion proteins that comprise any of the polypeptides mentioned abovealong with a heterologous (non-WFIKKN1) portion. An amino acid sequenceis understood to be heterologous to WFIKKN1 if it is not uniquely foundin human WFIKKN1, represented by SEQ ID NO: 26. Many examples ofheterologous portions are provided herein, and such heterologousportions may be immediately adjacent, by amino acid sequence, to theWFIKKN1 polypeptide portion of a fusion protein, or separated byintervening amino acid sequence, such as a linker or other sequence.

The human WFIKKN1 precursor has the following amino acid sequence (SEQID NO: 26) (NCBI Ref Seq NP_444514.1), with the signal peptide indicatedby dotted underline and the follistatin domain (WFIKKN1_(FD)) indicatedby solid underline.

(SEQ ID NO: 26)   1

 51 RECSRDQDCA AAEKCCINVC GLHSCVAARF PGSPAAPTTA ASCEGFVCPQ 101QGSDCDIWDG QPVCRCRDRC EKEPSFTCAS DGLTYYNRCY MDAEACLRGL 151HLHIVPCKHV LSWPPSSPGP PETTARPTPG AAPVPPALYS SPSPQAVQVG 201GTASLHCDVS GRPPPAVTWE KQSHQRENLI MRPDQMYGNV VVTSIGQLVL 251YNARPEDAGL YTCTARNAAG LLRADFPLSV VQREPARDAA PSIPAPAECL 301PDVQACTGPT SPHLVLWHYD PQRGGCMTFP ARGCDGAARG FETYEACQQA 351CARGPGDACV LPAVQGPCRG WEPRWAYSPL LQQCHPFVYG GCEGNGNNFH 401SRESCEDACP VPRTPPCRAC RLRSKLALSL CRSDFAIVGR LTEVLEEPEA 451AGGIARVALE DVLKDDKMGL KFLGTKYLEV TLSGMDWACP CPNMTAGDGP 501LVIMGEVRDG VAVLDAGSYV RAASEKRVKK ILELLEKQAC ELLNRFQD

Mature (processed) human WFIKKN1 has the following amino acid sequence(SEQ ID NO: 27) with the follistatin domain indicated by solidunderline. Moreover, it will be appreciated that any of the 13 aminoacids prior to the first cysteine may be removed by processing orintentionally eliminated without substantial consequence, andpolypeptides comprising such slightly smaller polypeptides are furtherincluded.

(SEQ ID NO: 27) 1 AGLLPGLGSH PGVCPNQLSP NLWVDAQSTC ERECSRDQDC AAAEKCCINV51 CGLHSCVAAR FPGSPAAPTT AASCEGFVCP QQGSDCDIWD GQPVCRCRDR 101CEKEPSFTCA SDGLTYYNRC YMDAEACLRG LHLHIVPCKH VLSWPPSSPG 151 PPETTARPTPGAAPVPPALY SSPSPQAVQV GGTASLHCDV SGRPPPAVTW 201 EKQSHQRENL IMRPDQMYGNVVVTSIGQLV LYNARPEDAG LYTCTARNAA 251 GLLRADFPLS VVQREPARDA APSIPAPAECLPDVQACTGP TSPHLVLWHY 301 DPQRGGCMTF PARGCDGAAR GFETYEACQQ ACARGPGDACVLPAVQGPCR 351 GWEPRWAYSP LLQQCHPFVY GGCEGNGNNF HSRESCEDAC PVPRTPPCRA401 CRLRSKLALS LCRSDFAIVG RLTEVLEEPE AAGGIARVAL EDVLKDDKMG 451LKFLGTKYLE VTLSGMDWAC PCPNMTAGDG PLVIMGEVRD GVAVLDAGSY 501 VRAASEKRVKKILELLEKQA CELLNRFQD

In certain aspects, the disclosure includes polypeptides comprising theWFIKKN1_(FD) domain as set forth below (SEQ ID NO: 28), and, forexample, one or more heterologous polypeptides.

(SEQ ID NO: 28) 1 CEGFVCPQQG SDCDIWDGQP VCRCRDRCEK EPSFTCASDG LTYYNRCYMD51 AEACLRGLHL HIVPC

If presented as WFIKKN1-Fc, this indicates that an Fc portion is fusedto the C-terminus of WFIKKN1, which may or may not include anintervening linker. The Fc in this instance may be any immunoglobulin Fcportion as that term is defined herein. If presented as WFIKKN1-G1Fc,this indicates that the Fc portion of IgG1 is fused at the C-terminus ofWFIKKN1. Unless indicated to the contrary, a protein described with thisnomenclature will represent a human WFIKKN1 protein.

The term “WFIKKN2 polypeptide” includes polypeptides comprising anynaturally occurring polypeptide of WFIKKN2 as well as any variantsthereof (including mutants, fragments, fusions, and peptidomimeticforms) that retain a useful activity. In certain preferred embodiments,WFIKKN2 polypeptides of the disclosure bind to and/or inhibit activityof myostatin, myostatin propeptide, complexes between myostatin and itspropeptide, GDF11, and potentially activins (e.g., ligand-mediatedactivation of ActRIIA and/or ActRIIB Smad2/3 signaling). Variants ofWFIKKN2 polypeptides that retain ligand binding properties can beidentified using routine methods to assay interactions between WFIKKN2and ligands [see, e.g., Kondas et al (2008) J Biol Chem 283:23677-23684;Szláma et al (2013) FEBS J 280:3822-3839]. In addition, methods formaking and testing libraries of polypeptides are described herein andsuch methods also pertain to making and testing variants of WFIKKN2.

For example, WFIKKN2 polypeptides include polypeptides comprising anamino acid sequence derived from the sequence of any known WFIKKN2polypeptide having a sequence at least about 80% identical to thesequence of a WFIKKN2 polypeptide (for example, SEQ ID NOs: 29-33), andoptionally at least 85%, 90%, 95%, 97%, 99% or greater identity to anyof SEQ ID NOs: 29-33. The term “WFIKKN2 fusion polypeptide” may refer tofusion proteins that comprise any of the polypeptides mentioned abovealong with a heterologous (non-WFIKKN2) portion. An amino acid sequenceis understood to be heterologous to WFIKKN2 if it is not uniquely foundin human WFIKKN2, represented by SEQ ID NO: 29. Many examples ofheterologous portions are provided herein, and such heterologousportions may be immediately adjacent, by amino acid sequence, to theWFIKKN2 polypeptide portion of a fusion protein, or separated byintervening amino acid sequence, such as a linker or other sequence.

The human WFIKKN2 precursor has the following amino acid sequence (SEQID NO: 29) (NCBI Ref Seq NP_783165.1), with the signal peptide indicatedby dotted underline and the follistatin domain (WFIKKN2_(FD)) indicatedby solid underline.

(SEQ ID NO: 29)   1

 51 NPNLWVDAQS TCRRECETDQ ECETYEKCCP NVCGTKSCVA ARYMDVKGKK 101GPVGMPKEAT CDHFMCLQQG SECDIWDGQP VCKCKDRCEK EPSFTCASDG 151LTYYNRCYMD AEACSKGITL AVVTCRYHFT WPNTSPPPPE TTMHPTTASP 201ETPELDMAAP ALLNNPVHQS VTMGETVSFL CDVVGRPRPE ITWEKQLEDR 251ENVVMRPNHV RGNVVVTNIA QLVIYNAQLQ DAGIYTCTAR NVAGVLRADF 301PLSVVRGHQA AATSESSPNG TAFPAAECLK PPDSEDCGEE QTRWHFDAQA 351NNCLTFTFGH CHRNLNHFET YEACMLACMS GPLAACSLPA LQGPCKAYAP 401RWAYNSQTGQ CQSFVYGGCE GNGNNFESRE ACEESCPFPR GNQRCRACKP 451RQKLVTSFCR SDFVILGRVS ELTEEPDSGR ALVTVDEVLK DEKMGLKFLG 501QEPLEVTLLH VDWACPCPNV TVSEMPLIIM GEVDGGMAML RPDSFVGASS 551ARRVRKLREV MHKKTCDVLK EFLGLH

Mature (processed) human WFIKKN2 has the following amino acid sequence(SEQ ID NO: 30) with the follistatin domain indicated by singleunderline. Moreover, it will be appreciated that any of the 11 aminoacids prior to the first cysteine may be removed by processing orintentionally eliminated without substantial consequence, andpolypeptides comprising such slightly smaller polypeptides are furtherincluded.

(SEQ ID NO: 30) 1 LPPIRYSHAG ICPNDMNPNL WVDAQSTCRR ECETDQECET YEKCCPNVCG51 TKSCVAARYM DVKGKKGPVG MPKEATCDHF MCLQQGSECD IWDGQPVCKC 101KDRCEKEPSF TCASDGLTYY NRCYMDAEAC SKGITLAVVT CRYHFTWPNT 151 SPPPPETTMHPTTASPETPE LDMAAPALLN NPVHQSVTMG ETVSFLCDVV 201 GRPRPEITWE KQLEDRENVVMRPNHVRGNV VVTNIAQLVI YNAQLQDAGI 251 YTCTARNVAG VLRADFPLSV VRGHQAAATSESSPNGTAFP AAECLKPPDS 301 EDCGEEQTRW HFDAQANNCL TFTFGHCHRN LNHFETYEACMLACMSGPLA 351 ACSLPALQGP CKAYAPRWAY NSQTGQCQSF VYGGCEGNGN NFESREACEE401 SCPFPRGNQR CRACKPRQKL VTSFCRSDFV ILGRVSELTE EPDSGRALVT 451VDEVLKDEKM GLKFLGQEPL EVTLLHVDWA CPCPNVTVSE MPLIIMGEVD 501 GGMAMLRPDSFVGASSARRV RKLREVMHKK TCDVLKEFLG LH

In certain aspects, the disclosure includes polypeptides comprising theWFIKKN2_(FD) domain as set forth below (SEQ ID NO: 31), and, forexample, one or more heterologous polypeptides.

(SEQ ID NO: 31) 1 CDHFMCLQQG SECDIWDGQP VCKCKDRCEK EPSFTCASDG LTYYNRCYMD51 AEACSKGITL AVVTC

The murine WFIKKN2 (GASP1) precursor has the following amino acidsequence (SEQ ID NO: 32) (NCBI Ref Seq NP_861540.2), with the signalpeptide indicated by dotted underline and the follistatin domain(WFIKKN2_(FD)) indicated by solid underline.

(SEQ ID NO: 32)   1

 51 VDAQSTCKRE CETDQECETY EKCCPNVCGT KSCVAARYMD VKGKKGPVGM 101PKEATCDHFM CLQQGSECDI WDGQPVCKCK DRCEKEPSFT CASDGLTYYN 151RCFMDAEACS KGITLSVVTC RYHFTWPNTS PPPPETTVHP TTASPETLGL 201DMAAPALLNH PVHQSVTVGE TVSFLCDVVG RPRPELTWEK QLEDRENVVM 251RPNHVRGNVV VTNIAQLVIY NVQPQDAGIY TCTARNVAGV LRADFPLSVV 301RGGQARATSE SSLNGTAFPA TECLKPPDSE DCGEEQTRWH FDAQANNCLT 351FTFGHCHHNL NHFETYEACM LACMSGPLAI CSLPALQGPC KAYVPRWAYN 401SQTGLCQSFV YGGCEGNGNN FESREACEES CPFPRGNQHC RACKPRQKLV 451TSFCRSDFVI LGRVSELTEE QDSGRALVTV DEVLKDEKMG LKFLGREPLE 501VTLLHVDWTC PCPNVTVGET PLIIMGEVDG GMAMLRPDSF VGASSTRRVR 551KLREVMYKKT CDVLKDFLGL Q

Mature (processed) murine WFIKKN2 has the following amino acid sequence(SEQ ID NO: 33) with the follistatin domain indicated by singleunderline. Moreover, it will be appreciated that any of the 11 aminoacids prior to the first cysteine may be removed by processing orintentionally eliminated without substantial consequence, andpolypeptides comprising such slightly smaller polypeptides are furtherincluded.

(SEQ ID NO: 33) 1 LPPIRYSHAG ICPNDMNPNL WVDAQSTCKR ECETDQECET YEKCCPNVCG51 TKSCVAARYM DVKGKKGPVG MPKEATCDHF MCLQQGSECD IWDGQPVCKC 101KDRCEKEPSF TCASDGLTYY NRCFMDAEAC SKGITLSVVT CRYHFTWPNT 151 SPPPPETTVHPTTASPETLG LDMAAPALLN HPVHQSVTVG ETVSFLCDVV 201 GRPRPELTWE KQLEDRENVVMRPNHVRGNV VVTNIAQLVI YNVQPQDAGI 251 YTCTARNVAG VLRADFPLSV VRGGQARATSESSLNGTAFP ATECLKPPDS 301 EDCGEEQTRW HFDAQANNCL TFTFGHCHHN LNHFETYEACMLACMSGPLA 351 ICSLPALQGP CKAYVPRWAY NSQTGLCQSF VYGGCEGNGN NFESREACEE401 SCPFPRGNQH CRACKPRQKL VTSFCRSDFV ILGRVSELTE EQDSGRALVT 451VDEVLKDEKM GLKFLGREPL EVTLLHVDWT CPCPNVTVGE TPLIIMGEVD 501 GGMAMLRPDSFVGASSTRRV RKLREVMYKK TCDVLKDFLG LQ

If presented as WFIKKN2-Fc, this indicates that an Fc portion is fusedto the C-terminus of WFIKKN2, which may or may not include anintervening linker. The Fc in this instance may be any immunoglobulin Fcportion as that term is defined herein. If presented as WFIKKN2-G1Fc,this indicates that the Fc portion of IgG1 is fused at the C-terminus ofWFIKKN2. Unless indicated to the contrary, a protein described with thisnomenclature will represent a human WFIKKN2 protein.

In certain aspects, the disclosure provides follistatin-relatedpolypeptides and follistatin-related fusion proteins that may inhibitthe ligands myostatin, activin A, activin B, and/or GDF11. The term“follistatin-related polypeptide” is used herein to refer to a singlepolypeptide chain comprising an amino acid sequence that is at least80%, 85%, 90%, 95%, 96%, 97%, 98%, 995 or 100% identical to the aminoacid sequence of at least one follistatin domain (for example,FST_(FD1), FST_(FD2), FST_(FD3), FLRG_(FD1), FLRG_(FD2), WFIKKN1_(FD),or WFIKKN2_(FD)). The term “follistatin-related polypeptide” refers topolypeptides comprising any naturally occurring polypeptide product ofthe follistatin gene, FLRG (FSTL3, FSRP) gene, WFIKKN1 (GASP2) gene, orWFIKKN2 (GASP1) gene as well as any variants thereof (including mutants,fragments, fusions, and peptidomimetic forms) that retain a usefulactivity, including, for example, ligand binding (e.g., myostatin,GDF11, activin A, activin B). For example, follistatin-relatedpolypeptides include polypeptides comprising an amino acid sequence thatis at least about 80%, 85%, 90%, 95%, 97%, 99% or greater identity toSEQ ID NOs: 1-33.

The term “follistatin-related fusion polypeptide” refers to single-chainfusion proteins that comprise a follistatin-related polypeptidementioned above along with a heterologous portion in a single amino acidsequence. An amino acid sequence is understood to be heterologous tofollistatin, FLRG, WFIKKN1, or WFIKKN2 if it is not uniquely found inhuman FST315, (represented by SEQ ID NO: 3), human FLRG (represented bySEQ ID NO: 18), human WFIKKN1 (represented by SEQ ID NO: 25), or humanWFIKKN2 (represented by SEQ ID NO: 28). Many examples of heterologousportions are provided herein, and such heterologous portions may beimmediately adjacent, by amino acid sequence, to the follistatin-relatedpolypeptide portion of a fusion protein, or separated by interveningamino acid sequence, such as a linker or other sequence, and may bepositioned amino-terminal to or carboxy-terminal to a portion that is afollistatin-related polypeptide.

In certain embodiments, the follistatin-related fusion proteinsdescribed herein refer to an asymmetric heterodimeric fusion proteincomprising a polypeptide chain derived from a naturally occurringfollistatin-related polypeptide. Accordingly, in certain embodiments,the methods of the present disclosure are directed to the use of one ormore follistatin-related fusion proteins, including fusion proteinscomprising a single-arm follistatin-related polypeptide containing atleast one follistatin domain, optionally in combination with one or moresupportive therapies, to treat a variety of applicable disorders,particularly disorders that may be addressed by inhibition of theligands to which such follistatin-related fusion protein binds. Forexample, a follistatin-related fusion protein that binds to and inhibitsmyostatin, and optionally other ligands such as GDF11, activin A and/oractivin B may be used to increase skeletal muscle mass in a subject inneed thereof and/or treat or prevent skeletal muscle loss or a skeletalmuscle disorder in a subject in need thereof.

As shown herein, follistatin-related polypeptides may be more amenableto expression as active proteins when expressed in a monomeric form, butsuch proteins tend to be challenging to purify and also tend to have ashort serum residence time (half-life), which are both undesirable inthe therapeutic setting. The purification problem may be solved byincorporation of an interaction pair with intrinsic characteristics thatfacilitate purification, such as properties associated with a constantdomain portion (e.g., Fc portion) of an IgG that enable purification ofattached proteins by methods already known in the art. A commonmechanism for improving serum half-life is to express a polypeptide as ahomodimeric fusion protein with a constant domain portion of an IgG.However, follistatin-related polypeptides expressed as homodimericproteins (e.g. in an Fc fusion construct) may not be as active orwell-produced as the monomeric form. As demonstrated herein, the problemmay be solved by fusing the monomeric form to a half-life extendingmoiety, and surprisingly, this can be expeditiously achieved byexpressing such proteins as an asymmetric heterodimeric fusion proteinin which one member of a binding pair is fused to a follistatin-relatedpolypeptide and another member of the binding pair is fused to no othermoiety or a heterologous moiety, resulting in a highly activefollistatin-related polypeptide coupled with an improvement in serumhalf-life conferred by the binding pair.

The numbering of amino acids in the follistatin-related polypeptides isbased on the sequence of SEQ ID NOs: 1, 3, 15, 18, 26, 29, or 32,regardless of whether the native leader sequence is used.

The terms used in this specification generally have their ordinarymeanings in the art, within the context of this disclosure and in thespecific context where each term is used. Certain terms are discussedbelow or elsewhere in the specification, to provide additional guidanceto the practitioner in describing the compositions and methods of thedisclosure and how to make and use them. The scope or meaning of any useof a term will be apparent from the specific context in which the termis used.

“Homologous,” in all its grammatical forms and spelling variations,refers to the relationship between two proteins that possess a “commonevolutionary origin,” including proteins from superfamilies in the samespecies of organism, as well as homologous proteins from differentspecies of organism. Such proteins (and their encoding nucleic acids)have sequence homology, as reflected by their sequence similarity,whether in terms of percent identity or by the presence of specificresidues or motifs and conserved positions.

The term “sequence similarity,” in all its grammatical forms, refers tothe degree of identity or correspondence between nucleic acid or aminoacid sequences that may or may not share a common evolutionary origin.

However, in common usage and in the instant application, the term“homologous,” when modified with an adverb such as “highly,” may referto sequence similarity and may or may not relate to a commonevolutionary origin.

“Percent (%) sequence identity” with respect to a reference polypeptide(or nucleotide) sequence is defined as the percentage of amino acidresidues (or nucleic acids) in a candidate sequence that are identicalto the amino acid residues (or nucleic acids) in the referencepolypeptide (nucleotide) sequence, after aligning the sequences andintroducing gaps, if necessary, to achieve the maximum percent sequenceidentity, and not considering any conservative substitutions as part ofthe sequence identity. Alignment for purposes of determining percentamino acid sequence identity can be achieved in various ways that arewithin the skill in the art, for instance, using publicly availablecomputer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR)software. Those skilled in the art can determine appropriate parametersfor aligning sequences, including any algorithms needed to achievemaximal alignment over the full length of the sequences being compared.For purposes herein, however, % amino acid (nucleic acid) sequenceidentity values are generated using the sequence comparison computerprogram ALIGN-2. The ALIGN-2 sequence comparison computer program wasauthored by Genentech, Inc., and the source code has been filed withuser documentation in the U.S. Copyright Office, Washington D.C., 20559,where it is registered under U.S. Copyright Registration No. TXU510087.The ALIGN-2 program is publicly available from Genentech, Inc., SouthSan Francisco, Calif., or may be compiled from the source code. TheALIGN-2 program should be compiled for use on a UNIX operating system,including digital UNIX V4.0D. All sequence comparison parameters are setby the ALIGN-2 program and do not vary.

As used herein “does not substantially bind to X” is intended to meanthat an agent has a K_(D) that is greater than about 10⁻⁷, 10⁻⁶, 10⁻⁵,10⁻⁴ or greater (e.g., no detectable binding by the assay used todetermine the K_(D)) for “X”.

2. Follistatin-Related Fusion Polypeptides

In certain aspects, the disclosure concerns follistatin-related fusionpolypeptides comprising one or more follistatin domains (e.g., FST-Fcpolypeptides, FLRG-Fc polypeptides, WFIKKN1-Fc polypeptides, andWFIKKN2-Fc polypeptides). In certain embodiments, the polypeptidesdisclosed herein may form protein complexes comprising a firstpolypeptide covalently or non-covalently associated with a secondpolypeptide, wherein the first polypeptide comprises the amino acidsequence of a follistatin-related polypeptide and the amino acidsequence of a first member of an interaction pair; and the secondpolypeptide comprises the amino acid sequence of a second member of theinteraction pair, and wherein the second polypeptide does not comprise afollistatin-related polypeptide. The interaction pair may be any twopolypeptide sequences that interact to form a complex, particularly aheterodimeric complex although operative embodiments may also employ aninteraction pair that forms a homodimeric sequence. As described herein,one member of the interaction pair may be fused to a follistatin-relatedpolypeptide, such as a polypeptide comprising an amino acid sequencethat is at least 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% identical tothe sequence of any of SEQ ID NOs: 1-33. Preferably, the interactionpair is selected to confer an improved means of protein purification oran improved serum half-life, or to act as an adapter on to which anothermoiety, such as a polyethylene glycol moiety, is attached to provide animproved serum half-life relative to the monomeric form of thefollistatin-related polypeptide.

As described above, follistatin is characterized by four cysteine-richregions (i.e., FST_(ND), FST_(FD1), FST_(FD2), and FST_(FD3)) that arethought to mediate follistatin ligand binding. Similarly, FLRG ischaracterized by three cysteine-rich regions (i.e., FLRG_(ND),FLRG_(FD1), and FLRG_(FD2)) and WFIKKN1 or WFIKKN2 are eachcharacterized by a cysteine-rich region (WFIKKN1_(FD) or WFIKKN2_(FD))that are thought to mediate binding to myostatin, activins, or GDF11.Furthermore, researchers have demonstrated that polypeptide constructscomprising only one of the three follistatin domains in FST (e.g.,FST_(FD1)) retains strong affinity towards certain follistatin ligands(e.g., myostatin) and are biologically active in vivo. See Nakatani etal. (2008) FASEB J 22:477-487. Therefore, variant follistatin-relatedpolypeptides of the disclosure may comprise one or more active portionsof a follistatin protein. For example, constructs of the disclosure maybegin at a residue corresponding to amino acids 30-95 of SEQ ID NO: 1and end at a position corresponding to amino acids 316-344 of SEQ IDNO: 1. Other examples include constructs that begin at a position from30-95 of SEQ ID NO: 1 and end at a position corresponding to amino acids164-167 or 238-244.

The follistatin, FLRG, WFIKKN1, and WFIKKN2 polypeptide variantsdescribed herein may be combined in various ways with each other or withheterologous amino acid sequences. For example, variantfollistatin-related fusion proteins of the disclosure includepolypeptides that comprise one or more follistatin domains selected fromFST_(FD1) (amino acids 95-164 of SEQ ID NO: 1; i.e., SEQ ID NO: 7),FST_(FD2) (amino acids 168-239 of SEQ ID NO:1; i.e., SEQ ID NO: 8), orFST_(FD3) (amino acids 245-316 of SEQ ID NO:1; SEQ ID NO: 9) as well asproteins that comprise one or more follistatin domains selected from asequence at least 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98% or 99%identical to FST_(FD1) (SEQ ID NO: 7), FST_(FD2) (SEQ ID NO: 8), orFST_(FD3) (SEQ ID NO: 9). Similarly, variant follistatin-related fusionproteins of the disclosure include polypeptides that comprise one ormore follistatin domains selected from FLRG_(FD1) (amino acids 99-167 ofSEQ ID NO: 18; i.e., SEQ ID NO: 22), FLRG_(FD2) (amino acids 171-243 ofSEQ ID NO: 18; i.e., SEQ ID NO: 23), WFIKKN1_(FD) (amino acids 93-157 ofSEQ ID NO: 26; i.e., SEQ ID NO: 28), or WFIKKN2_(FD) (amino acids111-175 of SEQ ID NO: 29; i.e., SEQ ID NO: 31) as well as proteins thatcomprise one or more follistatin domains selected from a sequence atleast 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identical toFST_(FD1) (SEQ ID NO: 7), FST_(FD2) (SEQ ID NO: 8), FST_(FD3) (SEQ IDNO: 9), FLRG_(FD1) (SEQ ID NO: 22), FLRG_(FD2) (SEQ ID NO: 23),WFIKKN1_(FD) (SEQ ID NO: 28), or WFIKKN2_(FD) (SEQ ID NO: 31).

These follistatin domains may be combined in any order within a variantfollistatin-related polypeptide of the disclosure provided that suchrecombinant proteins maintain the desired activity including, forexample, follistatin ligand-binding activity (e.g., myostatin) andbiological activity (e.g., inducing muscle mass and/or muscle strength).Examples of such variant follistatin polypeptides include, for example,polypeptides having domain structures such asFST_(FD1)-FST_(FD2)-FST_(FD3), FST_(FD1)-FST_(FD3),FST_(FD1)-FST_(FD1)-FST_(FD3), FST_(FD1)-FST_(FD2), FST_(Fm)-FST_(FD1),FST_(ND)-FST_(FD1)-FST_(FD2)-FST_(FD3), FST_(ND)-FST_(FD1)-FST_(FD2),FST_(ND)-FST_(Fm)-FST_(Fm), FST_(ND)-FST_(FD1)-FST_(FD3),FST_(ND)-FST_(Fm)-FST_(Fm)-FST_(FD3), and polypeptides obtained byfusing other heterologous polypeptides to the N-termini or the C-terminiof these polypeptides. Examples of variant follistatin-relatedpolypeptides include, for example, polypeptides having domain structuressuch as FLRG_(FD1)-FLRG_(FD1), FLRG_(FD1)-FLRG_(FD2),FLRG_(FD1)-FLRG_(FD1)-FLRG_(FD2), FLRG_(ND)-FLRG_(FD1)-FLRG_(FD1),FLRG_(ND)-FST_(FD1)-FST_(FD2), and polypeptides obtained by fusing otherheterologous polypeptides to the N-termini or the C-termini of thesepolypeptides. These domains may be directly fused or linked via a linkerpolypeptide. Optionally, polypeptide linkers may be any sequence and maycomprise 1-100, 1-50, preferably 1-10, and more preferably 1-5 aminoacids. In certain aspects, preferred linkers contain no cysteine aminoacids or protease cleavage sites.

In some embodiments, follistatin variants of the disclosure have reducedor abolished binding affinity for one or more follistatin ligands. Incertain aspects, the disclosure provides follistatin variants that havereduced or abolished binding affinity for activin. In certain aspects,the disclosure provides follistatin variants that have reduced orabolished binding affinity for activin but retain high affinity formyostatin. In certain aspects, the disclosure provides follistatinvariants that have reduced or abolished binding affinity for GDF11 butretain high affinity for myostatin.

In certain embodiments, the present invention relates to antagonizing aligand of follistatin or another follistatin-related polypeptide, suchas an activin, GDF8 or GDF11, with a subject follistatin-related fusionpolypeptide. Thus, compositions and methods of the present disclosureare useful for treating disorders associated with abnormal orundesirably high activity of one or more such ligands.

The follistatin related polypeptides of the disclosure may comprise asignal sequence. The signal sequence can be a native signal sequence ofa follistatin precursor (e.g., amino acids 1-29 of SEQ ID NO:1), FLRGprecursor (e.g., amino acids 1-26 of SEQ ID NO: 18), WFIKKN1 precursor(e.g., amino acids 1-19 of SEQ ID NO: 26), WFIKKN2 precursor (e.g.,amino acids 1-34 of SEQ ID NO: 29), or a signal sequence from anotherprotein, such as tissue plasminogen activator (TPA) signal sequence or ahoney bee melatin (HBM) signal sequence.

Further N-linked glycosylation sites (N-X-S/T) may be added to afollistatin related polypeptide, and may increase the serum half-life ofa follistatin-related fusion protein. N-X-S/T sequences may be generallyintroduced at positions outside the ligand-binding pocket. N-X-S/Tsequences may be introduced into the linker between the follistatinsequence and the Fc or other fusion component. Such a site may beintroduced with minimal effort by introducing an N in the correctposition with respect to a pre-existing S or T, or by introducing an Sor T at a position corresponding to a pre-existing N. Any S that ispredicted to be glycosylated may be altered to a T without creating animmunogenic site, because of the protection afforded by theglycosylation. Likewise, any T that is predicted to be glycosylated maybe altered to an S. Accordingly, a follistatin-related polypeptidevariant may include one or more additional, non-endogenous N-linkedglycosylation consensus sequences.

In certain embodiments, the present disclosure contemplates makingfunctional variants by modifying the structure of a follistatin-relatedpolypeptide for such purposes as enhancing therapeutic efficacy, orstability (e.g., ex vivo shelf life and resistance to proteolyticdegradation in vivo). Modified follistatin-related polypeptides can alsobe produced, for instance, by amino acid substitution, deletion, oraddition. For instance, it is reasonable to expect that an isolatedreplacement of a leucine with an isoleucine or valine, an aspartate witha glutamate, a threonine with a serine, or a similar replacement of anamino acid with a structurally related amino acid (e.g., conservativemutations) will not have a major effect on the biological activity ofthe resulting molecule. Conservative replacements are those that takeplace within a family of amino acids that are related in their sidechains. Whether a change in the amino acid sequence of afollistatin-related polypeptide results in a functional homolog can bereadily determined by assessing the ability of the variantfollistatin-related polypeptide to produce a response in cells in afashion similar to the wild-type follistatin-related polypeptide, or tobind to one or more ligands, such as myostatin or activin in a mannersimilar to wild-type follistatin-related polypeptide.

In certain embodiments, the present invention contemplates specificmutations of the follistatin-related polypeptides so as to alter theglycosylation of the polypeptide. Such mutations may be selected so asto introduce or eliminate one or more glycosylation sites, such asO-linked or N-linked glycosylation sites. Asparagine-linkedglycosylation recognition sites generally comprise a tripeptidesequence, asparagine-X-threonine (where “X” is any amino acid) which isspecifically recognized by appropriate cellular glycosylation enzymes.The alteration may also be made by the addition of, or substitution by,one or more serine or threonine residues to the sequence of thewild-type follistatin-related polypeptide (for O-linked glycosylationsites). A variety of amino acid substitutions or deletions at one orboth of the first or third amino acid positions of a glycosylationrecognition site (and/or amino acid deletion at the second position)results in non-glycosylation at the modified tripeptide sequence.Another means of increasing the number of carbohydrate moieties on afollistatin-related polypeptide is by chemical or enzymatic coupling ofglycosides to the follistatin-related polypeptide. Depending on thecoupling mode used, the sugar(s) may be attached to (a) arginine andhistidine; (b) free carboxyl groups; (c) free sulfhydryl groups such asthose of cysteine; (d) free hydroxyl groups such as those of serine,threonine, or hydroxyproline; (e) aromatic residues such as those ofphenylalanine, tyrosine, or tryptophan; or (f) the amide group ofglutamine. These methods are described in WO 87/05330 published Sep. 11,1987, and in Aplin and Wriston (1981) CRC Crit. Rev. Biochem., pp.259-306, incorporated by reference herein. Removal of one or morecarbohydrate moieties may be accomplished chemically and/orenzymatically. Chemical deglycosylation may involve, for example,exposure of the follistatin-related polypeptide to the compoundtrifluoromethanesulfonic acid, or an equivalent compound. This treatmentresults in the cleavage of most or all sugars except the linking sugar(N-acetylglucosamine or N-acetylgalactosamine), while leaving the aminoacid sequence intact. Chemical deglycosylation is further described byHakimuddin et al. (1987) Arch. Biochem. Biophys. 259:52 and by Edge etal. (1981) Anal. Biochem. 118:131. Enzymatic cleavage of carbohydratemoieties on follistatin-related polypeptides can be achieved by the useof a variety of endo- and exo-glycosidases as described by Thotakura etal. (1987) Meth. Enzymol. 138:350. The sequence of a follistatin-relatedpolypeptide may be adjusted, as appropriate, depending on the type ofexpression system used, as mammalian, yeast, insect and plant cells mayall introduce differing glycosylation patterns that can be affected bythe amino acid sequence of the peptide. In general, follistatin-relatedfusion proteins for use in humans will be expressed in a mammalian cellline that provides proper glycosylation, such as HEK293, COS, or CHOcell lines, although other mammalian expression cell lines are expectedto be useful as well.

This disclosure further contemplates a method of generating variants,particularly sets of combinatorial variants of a follistatin-relatedpolypeptide, including, optionally, truncation variants; pools ofcombinatorial mutants are especially useful for identifying functionalvariant sequences. The purpose of screening such combinatorial librariesmay be to generate, for example, follistatin-related polypeptidevariants that have altered properties, such as altered pharmacokinetics,or altered ligand binding. A variety of screening assays are providedbelow, and such assays may be used to evaluate variants. For example, afollistatin-related polypeptide variant may be screened for ability tobind to a ligand such as activin A, B, C or E, GDF8 or GDF11, or toprevent binding of a ligand to a ligand receptor such as ActRIIA orActRIIB

The activity of a follistatin-related polypeptide or its variants mayalso be tested in a cell-based or in vivo assay. For example, the effectof a follistatin-related polypeptide variant on the expression of genesinvolved in muscle production may be assessed. This may, as needed, beperformed in the presence of one or more recombinant ligand proteins(e.g., myostatin or activin A), and cells may be transfected so as toproduce a follistatin-related polypeptide and/or variants thereof, andoptionally, a ligand. Likewise, a follistatin-related polypeptide may beadministered to a mouse or other animal, and one or more muscleproperties, such as muscle mass or strength may be assessed. Such assaysare well known and routine in the art. A responsive reporter gene may beused in such cell lines to monitor effects on downstream signaling.

Combinatorially-derived variants can be generated which have a selectivepotency relative to a naturally occurring follistatin-relatedpolypeptide. Such variant proteins, when expressed from recombinant DNAconstructs, can be used in gene therapy protocols. Likewise, mutagenesiscan give rise to variants which have intracellular half-livesdramatically different than the corresponding a wild-typefollistatin-related polypeptide. For example, the altered protein can berendered either more stable or less stable to proteolytic degradation orother processes which result in destruction of, or otherwiseinactivation of a native follistatin-related polypeptide. Such variants,and the genes which encode them, can be utilized to alterfollistatin-related polypeptide levels by modulating the half-life ofthe follistatin-related polypeptides.

In certain embodiments, the follistatin-related polypeptides of thedisclosure may further comprise post-translational modifications inaddition to any that are naturally present in the follistatin-relatedpolypeptides. Such modifications include, but are not limited to,acetylation, carboxylation, glycosylation, phosphorylation, lipidation,and acylation. As a result, the follistatin-related polypeptides maycontain non-amino acid elements, such as polyethylene glycols, lipids,poly- or mono-saccharide, and phosphates. Effects of such non-amino acidelements on the functionality of a follistatin-related polypeptide maybe tested as described herein for other follistatin-related polypeptidevariants. When a follistatin-related polypeptide is produced in cells bycleaving a nascent form of the follistatin-related polypeptide,post-translational processing may also be important for correct foldingand/or function of the protein. Different cells (such as CHO, COS, HeLa,MDCK, 293, WI38, NIH-3T3 or HEK293) have specific cellular machinery andcharacteristic mechanisms for such post-translational activities and maybe chosen to ensure the correct modification and processing of thefollistatin-related polypeptides.

In certain aspects, functional variants or modified forms of thefollistatin-related polypeptides include fusion proteins having at leasta portion of a follistatin-related polypeptide and one or more fusiondomains. Well known examples of such fusion domains include, but are notlimited to, polyhistidine, Glu-Glu, glutathione S transferase (GST),thioredoxin, protein A, protein G, an immunoglobulin heavy chainconstant region (e.g., an Fc), maltose binding protein (MBP), or humanserum albumin. A fusion domain may be selected so as to confer a desiredproperty. For example, some fusion domains are particularly useful forisolation of the fusion proteins by affinity chromatography. For thepurpose of affinity purification, relevant matrices for affinitychromatography, such as glutathione-, amylase-, and nickel- orcobalt-conjugated resins are used. Many of such matrices are availablein “kit” form, such as the Pharmacia GST purification system and theQIAexpress™ system (Qiagen) useful with (HIS₆) fusion partners. Asanother example, a fusion domain may be selected so as to facilitatedetection of the follistatin-related polypeptides. Examples of suchdetection domains include the various fluorescent proteins (e.g., GFP)as well as “epitope tags,” which are usually short peptide sequences forwhich a specific antibody is available. Well-known epitope tags forwhich specific monoclonal antibodies are readily available include FLAG,influenza virus haemagglutinin (HA), and c-myc tags. In some cases, thefusion domains have a protease cleavage site, such as for Factor Xa orthrombin, which allows the relevant protease to partially digest thefusion proteins and thereby liberate the recombinant proteins therefrom.The liberated proteins can then be isolated from the fusion domain bysubsequent chromatographic separation. In certain preferred embodiments,a follistatin-related polypeptide is fused with a domain that stabilizesthe follistatin-related polypeptide in vivo (a “stabilizer” domain). By“stabilizing” is meant anything that increases serum half-life,regardless of whether this is because of reduced degradation, reducedclearance by the kidney, or another pharmacokinetic effect. Fusions withthe Fc portion of an immunoglobulin are known to confer desirablepharmacokinetic properties on a wide range of proteins. Likewise,fusions to human serum albumin can confer desirable properties. Othertypes of fusion domains that may be selected include multimerizing(e.g., dimerizing, tetramerizing) domains and functional domains (thatconfer an additional biological function, such as further stimulation ofmuscle growth).

As specific examples, the present disclosure provides fusion proteinscomprising follistatin-related polypeptides fused to a polypeptidecomprising a constant domain of an immunoglobulin, such as a CH1, CH2 orCH3 domain of an immunoglobulin or an Fc. Fc domains derived from humanIgG1, IgG2, IgG3, and IgG4 are provided below. Other mutations are knownthat decrease either CDC or ADCC activity, and collectively, any ofthese variants are included in the disclosure and may be used asadvantageous components of a follistatin fusion protein. Optionally, theIgG1 Fc domain of SEQ ID NO: 34 has one or more mutations at residuessuch as Asp-265, Lys-322, and Asn-434 (numbered in accordance with thecorresponding full-length IgG1). In certain cases, the mutant Fc domainhaving one or more of these mutations (e.g., Asp-265 mutation) hasreduced ability of binding to the Fcγ receptor relative to a wildtype Fcdomain. In other cases, the mutant Fc domain having one or more of thesemutations (e.g., Asn-434 mutation) has increased ability of binding tothe MHC class I-related Fc-receptor (FcRN) relative to a wildtype Fcdomain.

An example of a native amino acid sequence that may be used for the Fcportion of human IgG1 (G1Fc) is shown below (SEQ ID NO: 34). Dottedunderline indicates the hinge region, and solid underline indicatespositions with naturally occurring variants. In part, the disclosureprovides polypeptides comprising amino acid sequences with 80%, 85%,90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO: 34. Naturallyoccurring variants in G1Fc would include D134 and L136 according to thenumbering system used in SEQ ID NO: 34 (see Uniprot P01857).

(SEQ ID NO: 34)   1

 51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101VSNKALPAPI EKTISKAKGQ PREPQVYTLP PSREEMTKNQ VSLTCLVKGF 151YPSDIAVEWE SNGQPENNYK TTPPVLDSDG SFFLYSKLTV DKSRWQQGNV 201FSCSVMHEAL HNHYTQKSLS LSPGK

An example of a native amino acid sequence that may be used for the Fcportion of human IgG2 (G2Fc) is shown below (SEQ ID NO: 35). Dottedunderline indicates the hinge region, solid underline indicatespositions with naturally occurring variants, and double underlineindicates positions where there are data base conflicts in the sequence(according to UniProt P01859). In part, the disclosure providespolypeptides comprising amino acid sequences with 80%, 85%, 90%, 95%,96%, 97%, 98%, or 99% identity to SEQ ID NO: 35.

(SEQ ID NO: 35)   1

 51 FNWYVDGVEV HNAKTKPREE QFNSTFRVVS VLTVVHQDWL NGKEYKCKVS 101NKGLPAPIEK TISKTKGQPR EPQVYTLPPS REEMTKNQVS LTCLVKGFYP 151SDIAVEWESN GQPENNYKTT PPMLDSDGSF FLYSKLTVDK SRWQQGNVFS 201CSVMHEALHN HYTQKSLSLS PGK

Two examples of amino acid sequences that may be used for the Fc portionof human IgG3 (G3Fc) are shown below. The hinge region in G3Fc can be upto four times as long as in other Fc chains and contains three identical15-residue segments preceded by a similar 17-residue segment. The firstG3Fc sequence shown below (SEQ ID NO: 36) contains a short hinge regionconsisting of a single 15-residue segment, whereas the second G3Fcsequence (SEQ ID NO: 37) contains a full-length hinge region. In eachcase, dotted underline indicates the hinge region, and solid underlineindicates positions with naturally occurring variants according toUniProt P01859. In part, the disclosure provides polypeptides comprisingamino acid sequences with 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99%identity to SEQ ID NOs: 36, 37.

(SEQ ID NO: 36)   1

 51 VSHEDPEVQF KWYVDGVEVH NAKTKPREEQ YNSTFRVVSV LTVLHQDWLN 101GKEYKCKVSN KALPAPIEKT ISKTKGQPRE PQVYTLPPSR EEMTKNQVSL 151TCLVKGFYPS DIAVEWESSG QPENNYNTTP PMLDSDGSFF LYSKLTVDKS 201RWQQGNIFSC SVMHEALHNR FTQKSLSLSP GK (SEQ ID NO: 37)   1

 51

101 EDPEVQFKWY VDGVEVHNAK TKPREEQYNS TFRVVSVLTV LHQDWLNGKE 151YKCKVSNKAL PAPIEKTISK TKGQPREPQV YTLPPSREEM TKNQVSLTCL 201VKGFYPSDIA VEWESSGQPE NNYNTTPPML DSDGSFFLYS KLTVDKSRWQ 251QGNIFSCSVM HEALHNRFTQ KSLSLSPGK

Naturally occurring variants in G3Fc (for example, see Uniprot P01859)include E68Q, V69, P76L, E79Q, Y81F, D97N, N100D, T124A, S169N, S169del,F221Y when converted to the numbering system used in SEQ ID NO: 36, andthe present disclosure provides fusion proteins comprising G3Fc domainscontaining one or more of these variations. In addition, the humanimmunoglobulin IgG3 gene (IGHG3) shows a structural polymorphismcharacterized by different hinge lengths [see Uniprot P01859].Specifically, variant WIS is lacking most of the V region and all of theCH1 region. It has an extra interchain disulfide bond at position 7 inaddition to the 11 normally present in the hinge region. Variant ZUClacks most of the V region, all of the CH1 region, and part of thehinge. Variant OMM may represent an allelic form or another gamma chainsubclass. The present disclosure provides additional fusion proteinscomprising G3Fc domains containing one or more of these variants.

An example of a native amino acid sequence that may be used for the Fcportion of human IgG4 (G4Fc) is shown below (SEQ ID NO: 38). Dottedunderline indicates the hinge region. In part, the disclosure providespolypeptides comprising amino acid sequences with 80%, 85%, 90%, 95%,96%, 97%, 98%, or 99% identity to SEQ ID NO: 38.

(SEQ ID NO: 38)   1

 51 EDPEVQFNWY VDGVEVHNAK TKPREEQFNS TYRVVSVLTV LHQDWLNGKE 101YKCKVSNKGL PSSIEKTISK AKGQPREPQV YTLPPSQEEM TKNQVSLTCL 151VKGFYPSDIA VEWESNGQPE NNYKTTPPVL DSDGSFFLYS RLTVDKSRWQ 201EGNVFSCSVM HEALHNHYTQ KSLSLSLGK

A variety of engineered mutations in the Fc domain are presented hereinwith respect to the G1Fc sequence (SEQ ID NO: 34), and analogousmutations in G2Fc, G3Fc, and G4Fc can be derived from their alignmentwith G1Fc in FIG. 1. Due to unequal hinge lengths, analogous Fcpositions based on isotype alignment (FIG. 1) possess different aminoacid numbers in SEQ ID NOs: 34, 35, 36, and 38.

A problem that arises in large-scale production of asymmetricimmunoglobulin-based proteins from a single cell line is known as the“chain association issue”. As confronted prominently in the productionof bispecific antibodies, the chain association issue concerns thechallenge of efficiently producing a desired multichain protein fromamong the multiple combinations that inherently result when differentheavy chains and/or light chains are produced in a single cell line[see, for example, Klein et al (2012) mAbs 4:653-663]. This problem ismost acute when two different heavy chains and two different lightchains are produced in the same cell, in which case there are a total of16 possible chain combinations (although some of these are identical)when only one is typically desired. Nevertheless, the same principleaccounts for diminished yield of a desired multichain fusion proteinthat incorporates only two different (asymmetric) heavy chains.

Various methods are known in the art that increase desired pairing ofFc-containing fusion polypeptide chains in a single cell line to producea preferred asymmetric fusion protein at acceptable yields [see, forexample, Klein et al (2012) mAbs 4:653-663]. Methods to obtain desiredpairing of Fc-containing chains include, but are not limited to,charge-based pairing (electrostatic steering), “knobs-into-holes” stericpairing, and SEEDbody pairing. See, for example, Ridgway et al (1996)Protein Eng 9:617-621; Merchant et al (1998) Nat Biotech 16:677-681;Davis et al (2010) Protein Eng Des Sel 23:195-202. As demonstratedherein, an asymmetric Fc fusion protein comprising a single WFIKKN2polypeptide arm, in which charge-based pairing promotes the correctmatching of asymmetric polypeptide chains, inhibits myostatin activityin a cell-based reporter gene assay with substantially greater potency(lower IC₅₀) than a symmetric Fc fusion protein comprising dual WFIKKN2polypeptide arms.

In certain embodiments, the disclosure provides desired pairing ofasymmetric Fc-containing polypeptide chains using Fc sequencesengineered to be complementary on the basis of charge pairing(electrostatic steering). One of a pair of Fc sequences withelectrostatic complementarity can be arbitrarily fused to thefollistatin-related polypeptide (e.g., follistatin polypeptide, FLRGpolypeptide, WFIKKN1 polypeptide, or WFIKKN2 polypeptide) of theconstruct, with or without an optional linker, to generate afollistatin-related fusion polypeptide. This single chain can becoexpressed in a cell of choice along with the Fc sequence complementaryto the first Fc to favor generation of the desired multichain construct(a follistatin-related fusion protein). In this example based onelectrostatic steering, SEQ ID NO: 39 [human G1Fc(E134K/D177K)] and SEQID NO: 40 [human G1Fc(K170D/K187D)] are examples of complementary Fcsequences in which the engineered amino acid substitutions are doubleunderlined, and the follistatin-related polypeptide of the construct canbe fused to either SEQ ID NO: 39 or SEQ ID NO: 40, but not both. Giventhe high degree of amino acid sequence identity between native hG1Fc,native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated thatamino acid substitutions at corresponding positions in hG2Fc, hG3Fc, orhG4Fc (see FIG. 1) will generate complementary Fc pairs which may beused instead of the complementary hG1Fc pair below (SEQ ID NOs: 39, 40).

(SEQ ID NO: 39) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PSRKEMTKNQ VSLTCLVKGF 151 YPSDIAVEWE SNGQPENNYKTTPPVLKSDG SFFLYSKLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLS LSPGK (SEQ IDNO: 40) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE 51VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PSREEMTKNQ VSLTCLVKGF 151 YPSDIAVEWESNGQPENNYD TTPPVLDSDG SFFLYSDLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLSLSPGK

In part, the disclosure provides desired pairing of asymmetricFc-containing polypeptide chains using Fc sequences engineered forsteric complementarity. In part, the disclosure providesknobs-into-holes pairing as an example of steric complementarity. One ofa pair of Fc sequences with steric complementarity can be arbitrarilyfused to the follistatin-related polypeptide (e.g., follistatinpolypeptide, FLRG polypeptide, WFIKKN1 polypeptide, or WFIKKN2polypeptide) of the construct, with or without an optional linker, togenerate a follistatin-related fusion polypeptide. This single chain canbe coexpressed in a cell of choice along with the Fc sequencecomplementary to the first Fc to favor generation of the desiredmultichain construct. In this example based on knobs-into-holes pairing,SEQ ID NO: 41 [human G1Fc(T144Y)] and SEQ ID NO: 42 [human G1Fc(Y185T)]are examples of complementary Fc sequences in which the engineered aminoacid substitutions are double underlined, and the follistatin-relatedpolypeptide of the construct can be fused to either SEQ ID NO: 41 or SEQID NO: 42, but not both. Given the high degree of amino acid sequenceidentity between native hG1Fc, native hG2Fc, native hG3Fc, and nativehG4Fc, it can be appreciated that amino acid substitutions atcorresponding positions in hG2Fc, hG3Fc, or hG4Fc (see FIG. 1) willgenerate complementary Fc pairs which may be used instead of thecomplementary hG1Fc pair below (SEQ ID NOs: 41, 42).

(SEQ ID NO: 41) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PSREEMTKNQ VSLYCLVKGF 151 YPSDIAVEWE SNGQPENNYKTTPPVLDSDG SFFLYSKLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLS LSPGK (SEQ IDNO: 42) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE 51VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PSREEMTKNQ VSLTCLVKGF 151 YPSDIAVEWE SNGQPENNYKTTPPVLDSDG SFFLTSKLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLS LSPGK

An example of Fc complementarity based on knobs-into-holes pairingcombined with an engineered disulfide bond is disclosed in SEQ ID NO: 43[hG1Fc(S132C/T144W)] and SEQ ID NO: 44 [hG1Fc(Y127C/T144S/L146A/Y185V)].The engineered amino acid substitutions in these sequences are doubleunderlined, and the follistatin-related polypeptide of the construct canbe fused to either SEQ ID NO: 43 or SEQ ID NO: 44, but not both. Giventhe high degree of amino acid sequence identity between native hG1Fc,native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated thatamino acid substitutions at corresponding positions in hG2Fc, hG3Fc, orhG4Fc (see FIG. 1) will generate complementary Fc pairs which may beused instead of the complementary hG1Fc pair below (SEQ ID NOs: 43, 44).

(SEQ ID NO: 43) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PCREEMTKNQ VSLWCLVKGF 151 YPSDIAVEWE SNGQPENNYKTTPPVLDSDG SFFLYSKLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLS LSPGK (SEQ IDNO: 44) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE 51VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVCTLP PSREEMTKNQ VSLSCAVKGF 151 YPSDIAVEWE SNGQPENNYKTTPPVLDSDG SFFLVSKLTV DKSRWQQGNV 201 FSCSVMHEAL HNHYTQKSLS LSPGK

In part, the disclosure provides desired pairing of asymmetricFc-containing polypeptide chains using Fc sequences engineered togenerate interdigitating β-strand segments of human IgG and IgA C_(H)3domains. Such methods include the use of strand-exchange engineereddomain (SEED) C_(H)3 heterodimers allowing the formation of SEEDbodyfusion proteins [see, for example, Davis et al (2010) Protein Eng DesignSel 23:195-202]. One of a pair of Fc sequences with SEEDbodycomplementarity can be arbitrarily fused to the follistatin-relatedpolypeptide (e.g., follistatin polypeptide, FLRG polypeptide, WFIKKN1polypeptide, or WFIKKN2 polypeptide) of the construct, with or withoutan optional linker, to generate a follistatin-related fusionpolypeptide. This single chain can be coexpressed in a cell of choicealong with the Fc sequence complementary to the first Fc to favorgeneration of the desired multichain construct. In this example based onSEEDbody (Sb) pairing, SEQ ID NO: 45 [hG1Fc(Sb_(AG))] and SEQ ID NO: 46[hG1Fc(Sb_(GA))] are examples of complementary IgG Fc sequences in whichthe engineered amino acid substitutions from IgA Fc are doubleunderlined, and the follistatin-related polypeptide of the construct canbe fused to either SEQ ID NO: 45 or SEQ ID NO: 46, but not both. Giventhe high degree of amino acid sequence identity between native hG1Fe,native hG2Fc, native hG3Fc, and native hG4Fc, it can be appreciated thatamino acid substitutions at corresponding positions in hG1Fe, hG2Fc,hG3Fc, or hG4Fc (see FIG. 1) will generate an Fc monomer which may beused in the complementary IgG-IgA pair below (SEQ ID NOs: 45, 46).

(SEQ ID NO: 45) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PFRPEVHLLP PSREEMTKNQ VSLTCLARGF 151 YPKDIAVEWE SNGQPENNYKTTPSRQEPSQ GTTTFAVTSK LTVDKSRWQQ 201 GNVFSCSVMH EALHNHYTQK TISLSPGK (SEQID NO: 46) 1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLM ISRTPEVTCV VVDVSHEDPE 51VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQD WLNGKEYKCK 101 VSNKALPAPIEKTISKAKGQ PREPQVYTLP PPSEELALNE LVTLTCLVKG 151 FYPSDIAVEWESNGQELPRE KYLTWAPVLD SDGSFFLYSI LRVAAEDWKK 201 GDTFSCSVMH EALHNHYTQKSLDRSPGK

It is understood that different elements of the fusion proteins may bearranged in any manner that is consistent with the desiredfunctionality. For example, a follistatin-related polypeptide may beplaced C-terminal to a heterologous domain, or, alternatively, aheterologous domain may be placed C-terminal to a follistatin-relatedpolypeptide. The follistatin-related polypeptide domain and theheterologous domain need not be adjacent in a fusion protein, andadditional domains or amino acid sequences may be included C- orN-terminal to either domain or between the domains.

As used herein, the term “immunoglobulin Fc domain” or simply “Fc” isunderstood to mean the carboxyl-terminal portion of an immunoglobulinchain constant region, preferably an immunoglobulin heavy chain constantregion, or a portion thereof. For example, an immunoglobulin Fc regionmay comprise 1) a CH1 domain, a CH2 domain, and a CH3 domain, 2) a CH1domain and a CH2 domain, 3) a CH1 domain and a CH3 domain, 4) a CH2domain and a CH3 domain, or 5) a combination of two or more domains andan immunoglobulin hinge region. In a preferred embodiment theimmunoglobulin Fc region comprises at least an immunoglobulin hingeregion a CH2 domain and a CH3 domain, and preferably lacks the CH1domain. It is also understood that a follistatin polypeptide maycomprise only a domain of an immunoglobulin, such as a CH1 domain, a CH2domain or a CH3 domain. Many of these domains confer desirablepharmacokinetic properties as well as dimerization or higher ordermultimerization.

In one embodiment, the class of immunoglobulin from which the heavychain constant region is derived is IgG (Igγ) (γ subclasses 1, 2, 3, or4). Other classes of immunoglobulin, IgA (Igα), IgD (Igδ), IgE (Igε) andIgM (Igμ), may be used. The choice of appropriate immunoglobulin heavychain constant region is discussed in detail in U.S. Pat. Nos. 5,541,087and 5,726,044. The choice of particular immunoglobulin heavy chainconstant region sequences from certain immunoglobulin classes andsubclasses to achieve a particular result is considered to be within thelevel of skill in the art. The portion of the DNA construct encoding theimmunoglobulin Fc region preferably comprises at least a portion of ahinge domain, and preferably at least a portion of a CH₃ domain of Fcgamma or the homologous domains in any of IgA, IgD, IgE, or IgM.

Furthermore, it is contemplated that substitution or deletion of aminoacids within the immunoglobulin heavy chain constant regions may beuseful in the practice of the methods and compositions disclosed herein.One example would be to introduce amino acid substitutions in the upperCH2 region to create an Fc variant with reduced affinity for Fcreceptors (Cole et al. (1997) J. Immunol. 159:3613). Additionally, inmany instances, the C-terminal lysine, or K, will be removed and thusany of the polypeptides described herein may omit the C-terminal K thatis found in an Fc domain, such as those shown in SEQ ID NOs: 34-46.

In certain embodiments, the follistatin-related polypeptides of thepresent disclosure contain one or more modifications that are capable ofstabilizing the follistatin-related polypeptides. For example, suchmodifications enhance the in vitro half-life of the follistatin-relatedpolypeptides, enhance circulatory half-life of the follistatin-relatedpolypeptides or reducing proteolytic degradation of thefollistatin-related polypeptides. Such stabilizing modificationsinclude, but are not limited to, fusion proteins (including, forexample, fusion proteins comprising a follistatin-related polypeptideand a stabilizer domain), modifications of a glycosylation site(including, for example, addition of a glycosylation site to afollistatin-related polypeptide), and modifications of carbohydratemoiety (including, for example, removal of carbohydrate moieties from afollistatin-related polypeptide). In the case of fusion proteins, afollistatin-related polypeptide is fused to a stabilizer domain such asan IgG molecule (e.g., an Fc domain). As used herein, the term“stabilizer domain” not only refers to a fusion domain (e.g., Fc) as inthe case of fusion proteins, but also includes nonproteinaceousmodifications such as a carbohydrate moiety, or nonproteinaceouspolymer, such as polyethylene glycol.

In certain embodiments, the present invention makes available isolatedand/or purified forms of the follistatin-related polypeptides, which areisolated from, or otherwise substantially free of, other proteins. Incertain embodiments, the present invention facilitates purification oftherapeutically active follistatin-related polypeptides by attachment ofan interaction pair (for example, an Fc domain) possessing propertiesadvantageous for purification.

In certain embodiments, follistatin-related polypeptides (unmodified ormodified) of the disclosure can be produced by a variety of art-knowntechniques. For example, such follistatin-related polypeptides can besynthesized using standard protein chemistry techniques such as thosedescribed in Bodansky, M. Principles of Peptide Synthesis, SpringerVerlag, Berlin (1993) and Grant G. A. (ed.), Synthetic Peptides: AUser's Guide, W. H. Freeman and Company, New York (1992). In addition,automated peptide synthesizers are commercially available (e.g.,Advanced ChemTech Model 396; Milligen/Biosearch 9600). Alternatively,the follistatin-related polypeptides, fragments or variants thereof maybe recombinantly produced using various expression systems (e.g., E.coli, Chinese Hamster Ovary cells, COS cells, baculovirus) as is wellknown in the art (also see below). In a further embodiment, the modifiedor unmodified follistatin-related polypeptides may be produced bydigestion of naturally occurring or recombinantly produced full-lengthfollistatin-related polypeptides by using, for example, a protease,e.g., trypsin, thermolysin, chymotrypsin, pepsin, or paired basic aminoacid converting enzyme (PACE). Computer analysis (using a commerciallyavailable software, e.g., MacVector, Omega, PCGene, MolecularSimulation, Inc.) can be used to identify proteolytic cleavage sites.Alternatively, such follistatin-related polypeptides may be producedfrom naturally occurring or recombinantly produced full-lengthfollistatin-related polypeptides such as standard techniques known inthe art, such as by chemical cleavage (e.g., cyanogen bromide,hydroxylamine).

Any of the follistatin-related polypeptides disclosed herein may becombined with one or more additional follistatin-related polypeptides ofthe disclosure to achieve a desired effect such as treating afollistatin-related disorder (e.g., increase muscle mass and/or strengthin a subject in need thereof, treat or prevent muscle loss in a subjectin need thereof, treat or prevent one or more complications of muscleloss in a subject in need thereof; increase hemoglobin concentration orred blood cell count in a subject in need thereof, treat or preventinadequate hemoglobin concentration or red blood cell count in a subjectin need thereof, treat or prevent one or more complications ofinadequate hemoglobin concentration or red blood cell count in a subjectin need thereof; increase bone mass and/or strength in a subject in needthereof, treat or prevent bone loss or fragility in a subject in needthereof, treat or prevent one or more complications of bone loss orfragility in a subject in need thereof; or treat cancer in a subject inneed thereof). For example, a follistatin polypeptide disclosed hereincan be used in combination with i) one or more additional FLRGpolypeptides disclosed herein, ii) one or more WFIKKN1 polypeptidesdisclosed herein, and/or iii) one or more WFIKKN2 polypeptides disclosedherein.

3. Nucleic Acids Encoding Follistatin-Related Polypeptides

In certain aspects, the invention provides isolated and/or recombinantnucleic acids encoding any of the follistatin-related polypeptidesdisclosed herein. The subject nucleic acids may be single stranded ordouble stranded. Such nucleic acids may be DNA or RNA molecules. Thesenucleic acids are may be used, for example, in methods for makingfollistatin-related polypeptides.

For example, the following sequence encodes a naturally occurring humanfollistatin precursor polypeptide (SEQ ID NO: 47) (nucleotides 359-1390of NCBI Reference Sequence: NM_013409.2):

(SEQ ID NO: 47) AtggtccgcgcgaggcaccagccgggtgggctttgcctcctgctgctgctgctctgccagttcatggaggaccgcagtgcccaggctgggaactgctggctccgtcaagcgaagaacggccgctgccaggtcctgtacaagaccgaactgagcaaggaggagtgctgcagcaccggccggctgagcacctcgtggaccgaggaggacgtgaatgacaacacactcttcaagtggatgattttcaacgggggcgcccccaactgcatcccctgtaaagaaacgtgtgagaacgtggactgtggacctgggaaaaaatgccgaatgaacaagaagaacaaaccccgctgcgtctgcgccccggattgttccaacatcacctggaagggtccagtctgcgggctggatgggaaaacctaccgcaatgaatgtgcactcctaaaggcaagatgtaaagagcagccagaactggaagtccagtaccaaggcagatgtaaaaagacttgtcgggatgttttctgtccaggcagctccacatgtgtggtggaccagaccaataatgcctactgtgtgacctgtaatcggatttgcccagagcctgcttcctctgagcaatatctctgtgggaatgatggagtcacctactccagtgcctgccacctgagaaaggctacctgcctgctgggcagatctattggattagcctatgagggaaagtgtatcaaagcaaagtcctgtgaagatatccagtgcactggtgggaaaaaatgtttatgggatttcaaggttgggagaggccggtgttccctctgtgatgagctgtgccctgacagtaagtcggatgagcctgtctgtgccagtgacaatgccacttatgccagcgagtgtgccatgaaggaagctgcctgctcctcaggtgtgctactggaagtaaagcactccggatcttgcaactccatttcggaagacaccgaggaagaggaggaagatgaagaccaggactacagctttcctatatcttctattctagagtggThe following sequence encodes the mature FST315 polypeptide (SEQ ID NO:48).

(SEQ ID NO: 48) GggaactgctggctccgtcaagcgaagaacggccgctgccaggtcctgtacaagaccgaactgagcaaggaggagtgctgcagcaccggccggctgagcacctcgtggaccgaggaggacgtgaatgacaacacactcttcaagtggatgattttcaacgggggcgcccccaactgcatcccctgtaaagaaacgtgtgagaacgtggactgtggacctgggaaaaaatgccgaatgaacaagaagaacaaaccccgctgcgtctgcgccccggattgttccaacatcacctggaagggtccagtctgcgggctggatgggaaaacctaccgcaatgaatgtgcactcctaaaggcaagatgtaaagagcagccagaactggaagtccagtaccaaggcagatgtaaaaagacttgtcgggatgttttctgtccaggcagctccacatgtgtggtggaccagaccaataatgcctactgtgtgacctgtaatcggatttgcccagagcctgcttcctctgagcaatatctctgtgggaatgatggagtcacctactccagtgcctgccacctgagaaaggctacctgcctgctgggcagatctattggattagcctatgagggaaagtgtatcaaagcaaagtcctgtgaagatatccagtgcactggtgggaaaaaatgtttatgggatttcaaggttgggagaggccggtgttccctctgtgatgagctgtgccctgacagtaagtcggatgagcctgtctgtgccagtgacaatgccacttatgccagcgagtgtgccatgaaggaagctgcctgctcctcaggtgtgctactggaagtaaagcactccggatcttgcaactccatttcggaagacaccgaggaagaggaggaagatgaagaccaggactacagctttcctatatcttctattctagagtggThe following sequence encodes the FST288 polypeptide (SEQ ID NO: 49).

(SEQ ID NO: 49) gggaactgctggctccgtcaagcgaagaacggccgctgccaggtcctgtacaagaccgaactgagcaaggaggagtgctgcagcaccggccggctgagcacctcgtggaccgaggaggacgtgaatgacaacacactcttcaagtggatgattttcaacgggggcgcccccaactgcatcccctgtaaagaaacgtgtgagaacgtggactgtggacctgggaaaaaatgccgaatgaacaagaagaacaaaccccgctgcgtctgcgccccggattgttccaacatcacctggaagggtccagtctgcgggctggatgggaaaacctaccgcaatgaatgtgcactcctaaaggcaagatgtaaagagcagccagaactggaagtccagtaccaaggcagatgtaaaaagacttgtcgggatgttttctgtccaggcagctccacatgtgtggtggaccagaccaataatgcctactgtgtgacctgtaatcggatttgcccagagcctgcttcctctgagcaatatctctgtgggaatgatggagtcacctactccagtgcctgccacctgagaaaggctacctgcctgctgggcagatctattggattagcctatgagggaaagtgtatcaaagcaaagtcctgtgaagatatccagtgcactggtgggaaaaaatgtttatgggatttcaaggttgggagaggccggtgttccctctgtgatgagctgtgccctgacagtaagtcggatgagcctgtctgtgccagtgacaatgccacttatgccagcgagtgtgccatgaaggaagctgcctgctcctcaggtgtgctactggaagtaaagcact ccggatcttgcaacThe following sequence encodes the mature FST291 polypeptide (SEQ ID NO:50).

(SEQ ID NO: 50) Gggaactgctggctccgtcaagcgaagaacggccgctgccaggtcctgtacaagaccgaactgagcaaggaggagtgctgcagcaccggccggctgagcacctcgtggaccgaggaggacgtgaatgacaacacactcttcaagtggatgattttcaacgggggcgcccccaactgcatcccctgtaaagaaacgtgtgagaacgtggactgtggacctgggaaaaaatgccgaatgaacaagaagaacaaaccccgctgcgtctgcgccccggattgttccaacatcacctggaagggtccagtctgcgggctggatgggaaaacctaccgcaatgaatgtgcactcctaaaggcaagatgtaaagagcagccagaactggaagtccagtaccaaggcagatgtaaaaagacttgtcgggatgttttctgtccaggcagctccacatgtgtggtggaccagaccaataatgcctactgtgtgacctgtaatcggatttgcccagagcctgcttcctctgagcaatatctctgtgggaatgatggagtcacctactccagtgcctgccacctgagaaaggctacctgcctgctgggcagatctattggattagcctatgagggaaagtgtatcaaagcaaagtcctgtgaagatatccagtgcactggtgggaaaaaatgtttatgggatttcaaggttgggagaggccggtgttccctctgtgatgagctgtgccctgacagtaagtcggatgagcctgtctgtgccagtgacaatgccacttatgccagcgagtgtgccatgaaggaagctgcctgctcctcaggtgtgctactggaagtaaagcactccggatcttgcaactccatttcgtgg

For example, the following sequence (SEQ ID NO: 51) encodes a naturallyoccurring human FLRG precursor polypeptide (nucleotides 36-824 of NCBIReference Sequence NM_005860.2). Nucleotides encoding the signalsequence are underlined.

(SEQ ID NO: 51) 1 atgcgtcccg gggcgccagg gccactctgg cctctgccct ggggggccct51 ggcttgggcc gtgggcttcg tgagctccat gggctcgggg aaccccgcgc 101 ccggtggtgtttgctggctc cagcagggcc aggaggccac ctgcagcctg 151 gtgctccaga ctgatgtcacccgggccgag tgctgtgcct ccggcaacat 201 tgacaccgcc tggtccaacc tcacccacccggggaacaag atcaacctcc 251 tcggcttctt gggccttgtc cactgccttc cctgcaaagattcgtgcgac 301 ggcgtggagt gcggcccggg caaggcgtgc cgcatgctgg ggggccgccc351 gcgctgcgag tgcgcgcccg actgctcggg gctcccggcg cggctgcagg 401tctgcggctc agacggcgcc acctaccgcg acgagtgcga gctgcgcgcc 451 gcgcgctgccgcggccaccc ggacctgagc gtcatgtacc ggggccgctg 501 ccgcaagtcc tgtgagcacgtggtgtgccc gcggccacag tcgtgcgtcg 551 tggaccagac gggcagcgcc cactgcgtggtgtgtcgagc ggcgccctgc 601 cctgtgccct ccagccccgg ccaggagctt tgcggcaacaacaacgtcac 651 ctacatctcc tcgtgccaca tgcgccaggc cacctgcttc ctgggccgct701 ccatcggcgt gcgccacgcg ggcagctgcg caggcacccc tgaggagccg 751ccaggtggtg agtctgcaga agaggaagag aacttcgtg

The following sequence (SEQ ID NO: 52) encodes a mature human FLRGpolypeptide (nucleotides 114-824 of NCBI Reference SequenceNM_005860.2).

(SEQ ID NO: 52) 1 atgggctcgg ggaaccccgc gcccggtggt gtttgctggc tccagcaggg51 ccaggaggcc acctgcagcc tggtgctcca gactgatgtc acccgggccg 101 agtgctgtgcctccggcaac attgacaccg cctggtccaa cctcacccac 151 ccggggaaca agatcaacctcctcggcttc ttgggccttg tccactgcct 201 tccctgcaaa gattcgtgcg acggcgtggagtgcggcccg ggcaaggcgt 251 gccgcatgct ggggggccgc ccgcgctgcg agtgcgcgcccgactgctcg 301 gggctcccgg cgcggctgca ggtctgcggc tcagacggcg ccacctaccg351 cgacgagtgc gagctgcgcg ccgcgcgctg ccgcggccac ccggacctga 401gcgtcatgta ccggggccgc tgccgcaagt cctgtgagca cgtggtgtgc 451 ccgcggccacagtcgtgcgt cgtggaccag acgggcagcg cccactgcgt 501 ggtgtgtcga gcggcgccctgccctgtgcc ctccagcccc ggccaggagc 551 tttgcggcaa caacaacgtc acctacatctcctcgtgcca catgcgccag 601 gccacctgct tcctgggccg ctccatcggc gtgcgccacgcgggcagctg 651 cgcaggcacc cctgaggagc cgccaggtgg tgagtctgca gaagaggaag701 agaacttcgt g

For example, the following sequence (SEQ ID NO: 53) encodes a naturallyoccurring human WFIKKN1 precursor polypeptide (nucleotides 243-1886 ofNCBI Reference Sequence NM_053284.2). Nucleotides encoding the signalsequence are underlined.

(SEQ ID NO: 53)    1atgcccgccc tacgtccact cctgccgctc ctgctcctcc tccggctgac   51ctcgggggct ggcttgctgc cagggctggg gagccacccg ggcgtgtgcc  101ccaaccagct cagccccaac ctgtgggtgg acgcccagag cacctgtgag  151cgcgagtgta gcagggacca ggactgtgcg gctgctgaga agtgctgcat  201caacgtgtgt ggactgcaca gctgcgtggc agcacgcttc cccggcagcc  251cagctgcgcc gacgacagcg gcctcctgcg agggctttgt gtgcccacag  301cagggctcgg actgcgacat ctgggacggg cagcccgtgt gccgctgccg  351cgaccgctgt gagaaggagc ccagcttcac ctgcgcctcg gacggcctca  401cctactacaa ccgctgctat atggacgccg aggcctgcct gcggggcctg  451cacctccaca tcgtgccctg caagcacgtg ctcagctggc cgcccagcag  501cccggggccg ccggagacca ctgcccgccc cacacctggg gccgcgcccg  551tgcctcctgc cctgtacagc agcccctccc cacaggcggt gcaggttggg  601ggtacggcca gcctccactg cgacgtcagc ggccgcccgc cgcctgctgt  651gacctgggag aagcagagtc accagcgaga gaacctgatc atgcgccctg  701atcagatgta tggcaacgtg gtggtcacca gcatcgggca gctggtgctc  751tacaacgcgc ggcccgaaga cgccggcctg tacacctgca ccgcgcgcaa  801cgctgctggg ctgctgcggg ctgacttccc actctctgtg gtccagcgag  851agccggccag ggacgcagcc cccagcatcc cagccccggc cgagtgcctg  901ccggatgtgc aggcctgcac gggccccact tccccacacc ttgtcctctg  951gcactacgac ccgcagcggg gcggctgcat gaccttcccg gcccgtggct 1001gtgatggggc ggcccgcggc tttgagacct acgaggcatg ccagcaggcc 1051tgtgcccgcg gccccggcga cgcctgcgtg ctgcctgccg tgcagggccc 1101ctgccggggc tgggagccgc gctgggccta cagcccgctg ctgcagcagt 1151gccatccctt cgtgtacggt ggctgcgagg gcaacggcaa caacttccac 1201agccgcgaga gctgcgagga tgcctgcccc gtgccgcgca caccgccctg 1251ccgcgcctgc cgcctccgga gcaagctggc gctgagcctg tgccgcagcg 1301acttcgccat cgtggggcgg ctcacggagg tgctggagga gcccgaggcc 1351gccggcggca tcgcccgcgt ggcgctcgag gacgtgctca aggatgacaa 1401gatgggcctc aagttcttgg gcaccaagta cctggaggtg acgctgagtg 1451gcatggactg ggcctgcccc tgccccaaca tgacggcggg cgacgggccg 1501ctggtcatca tgggtgaggt gcgcgatggc gtggccgtgc tggacgccgg 1551cagctacgtc cgcgccgcca gcgagaagcg cgtcaagaag atcttggagc 1601tgctggagaa gcaggcctgc gagctgctca accgcttcca ggac

The following sequence (SEQ ID NO: 54) encodes a mature human WFIKKN1polypeptide (nucleotides 300-1886 of NCBI Ref Seq NM_053284.2).

(SEQ ID NO: 54) 1 gctggcttgc tgccagggct ggggagccac ccgggcgtgt gccccaacca51 gctcagcccc aacctgtggg tggacgccca gagcacctgt gagcgcgagt 101 gtagcagggaccaggactgt gcggctgctg agaagtgctg catcaacgtg 151 tgtggactgc acagctgcgtggcagcacgc ttccccggca gcccagctgc 201 gccgacgaca gcggcctcct gcgagggctttgtgtgccca cagcagggct 251 cggactgcga catctgggac gggcagcccg tgtgccgctgccgcgaccgc 301 tgtgagaagg agcccagctt cacctgcgcc tcggacggcc tcacctacta351 caaccgctgc tatatggacg ccgaggcctg cctgcggggc ctgcacctcc 401acatcgtgcc ctgcaagcac gtgctcagct ggccgcccag cagcccgggg 451 ccgccggagaccactgcccg ccccacacct ggggccgcgc ccgtgcctcc 501 tgccctgtac agcagcccctccccacaggc ggtgcaggtt gggggtacgg 551 ccagcctcca ctgcgacgtc agcggccgcccgccgcctgc tgtgacctgg 601 gagaagcaga gtcaccagcg agagaacctg atcatgcgccctgatcagat 651 gtatggcaac gtggtggtca ccagcatcgg gcagctggtg ctctacaacg701 cgcggcccga agacgccggc ctgtacacct gcaccgcgcg caacgctgct 751gggctgctgc gggctgactt cccactctct gtggtccagc gagagccggc 801 cagggacgcagcccccagca tcccagcccc ggccgagtgc ctgccggatg 851 tgcaggcctg cacgggccccacttccccac accttgtcct ctggcactac 901 gacccgcagc ggggcggctg catgaccttcccggcccgtg gctgtgatgg 951 ggcggcccgc ggctttgaga cctacgaggc atgccagcaggcctgtgccc 1001 gcggccccgg cgacgcctgc gtgctgcctg ccgtgcaggg cccctgccgg1051 ggctgggagc cgcgctgggc ctacagcccg ctgctgcagc agtgccatcc 1101cttcgtgtac ggtggctgcg agggcaacgg caacaacttc cacagccgcg 1151 agagctgcgaggatgcctgc cccgtgccgc gcacaccgcc ctgccgcgcc 1201 tgccgcctcc ggagcaagctggcgctgagc ctgtgccgca gcgacttcgc 1251 catcgtgggg cggctcacgg aggtgctggaggagcccgag gccgccggcg 1301 gcatcgcccg cgtggcgctc gaggacgtgc tcaaggatgacaagatgggc 1351 ctcaagttct tgggcaccaa gtacctggag gtgacgctga gtggcatgga1401 ctgggcctgc ccctgcccca acatgacggc gggcgacggg ccgctggtca 1451tcatgggtga ggtgcgcgat ggcgtggccg tgctggacgc cggcagctac 1501 gtccgcgccgccagcgagaa gcgcgtcaag aagatcttgg agctgctgga 1551 gaagcaggcc tgcgagctgctcaaccgctt ccaggac

For example, the following sequence (SEQ ID NO: 55) encodes a naturallyoccurring human WFIKKN2 precursor polypeptide (nucleotides 695-2422 ofNCBI Reference Sequence NM_175575.5). Nucleotides encoding the signalsequence are underlined.

(SEQ ID NO: 55) 1 atgtgggccc caaggtgtcg ccggttctgg tctcgctggg agcaggtggc51 agcgctgctg ctgctgctgc tactgctcgg ggtgcccccg cgaagcctgg 101 cgctgccgcccatccgctat tcccacgccg gcatctgccc caacgacatg 151 aatcccaacc tctgggtggacgcacagagc acctgcaggc gggagtgtga 201 gacggaccag gagtgtgaga cctatgagaagtgctgcccc aacgtatgtg 251 ggaccaagag ctgcgtggcg gcccgctaca tggacgtgaaagggaagaag 301 ggcccagtgg gcatgcccaa ggaggccaca tgtgaccact tcatgtgtct351 gcagcagggc tctgagtgtg acatctggga tggccagccc gtgtgtaagt 401gcaaagaccg ctgtgagaag gagcccagct ttacctgcgc ctcggacggc 451 ctcacctactataaccgctg ctacatggat gccgaggcct gctccaaagg 501 catcacactg gccgttgtaacctgccgcta tcacttcacc tggcccaaca 551 ccagcccccc accacctgag accaccatgcaccccaccac agcctcccca 601 gagacccctg agctggacat ggcggcccct gcgctgctcaacaaccctgt 651 gcaccagtcg gtcaccatgg gtgagacagt gagcttcctc tgtgatgtgg701 tgggccggcc ccggcctgag atcacctggg agaagcagtt ggaggatcgg 751gagaatgtgg tcatgcggcc caaccatgtg cgtggcaacg tggtggtcac 801 caacattgcccagctggtca tctataacgc ccagctgcag gatgctggga 851 tctacacctg cacggcccggaacgtggctg gggtcctgag ggctgatttc 901 ccgctgtcgg tggtcagggg tcatcaggctgcagccacct cagagagcag 951 ccccaatggc acggctttcc cggcggccga gtgcctgaagcccccagaca 1001 gtgaggactg tggcgaagag cagacccgct ggcacttcga tgcccaggcc1051 aacaactgcc tgaccttcac cttcggccac tgccaccgta acctcaacca 1101ctttgagacc tatgaggcct gcatgctggc ctgcatgagc gggccgctgg 1151 ccgcgtgcagcctgcccgcc ctgcaggggc cctgcaaagc ctacgcgcct 1201 cgctgggctt acaacagccagacgggccag tgccagtcct ttgtctatgg 1251 tggctgcgag ggcaatggca acaactttgagagccgtgag gcctgtgagg 1301 agtcgtgccc cttccccagg gggaaccagc gctgtcgggcctgcaagcct 1351 cggcagaagc tcgttaccag cttctgtcgc agcgactttg tcatcctggg1401 ccgagtctct gagctgaccg aggagcctga ctcgggccgc gccctggtga 1451ctgtggatga ggtcctaaag gatgagaaaa tgggcctcaa gttcctgggc 1501 caggagccattggaggtcac tctgcttcac gtggactggg catgcccctg 1551 ccccaacgtg accgtgagcgagatgccgct catcatcatg ggggaggtgg 1601 acggcggcat ggccatgctg cgccccgatagctttgtggg cgcatcgagt 1651 gcccgccggg tcaggaagct tcgtgaggtc atgcacaagaagacctgtga 1701 cgtcctcaag gagtttcttg gcttgcac

The following sequence (SEQ ID NO: 56) encodes a mature human WFIKKN2polypeptide (nucleotides 797-2422 of NCBI Ref Seq NM_175575.5).

(SEQ ID NO: 56) 1 ctgccgccca tccgctattc ccacgccggc atctgcccca acgacatgaa51 tcccaacctc tgggtggacg cacagagcac ctgcaggcgg gagtgtgaga 101 cggaccaggagtgtgagacc tatgagaagt gctgccccaa cgtatgtggg 151 accaagagct gcgtggcggcccgctacatg gacgtgaaag ggaagaaggg 201 cccagtgggc atgcccaagg aggccacatgtgaccacttc atgtgtctgc 251 agcagggctc tgagtgtgac atctgggatg gccagcccgtgtgtaagtgc 301 aaagaccgct gtgagaagga gcccagcttt acctgcgcct cggacggcct351 cacctactat aaccgctgct acatggatgc cgaggcctgc tccaaaggca 401tcacactggc cgttgtaacc tgccgctatc acttcacctg gcccaacacc 451 agccccccaccacctgagac caccatgcac cccaccacag cctccccaga 501 gacccctgag ctggacatggcggcccctgc gctgctcaac aaccctgtgc 551 accagtcggt caccatgggt gagacagtgagcttcctctg tgatgtggtg 601 ggccggcccc ggcctgagat cacctgggag aagcagttggaggatcggga 651 gaatgtggtc atgcggccca accatgtgcg tggcaacgtg gtggtcacca701 acattgccca gctggtcatc tataacgccc agctgcagga tgctgggatc 751tacacctgca cggcccggaa cgtggctggg gtcctgaggg ctgatttccc 801 gctgtcggtggtcaggggtc atcaggctgc agccacctca gagagcagcc 851 ccaatggcac ggctttcccggcggccgagt gcctgaagcc cccagacagt 901 gaggactgtg gcgaagagca gacccgctggcacttcgatg cccaggccaa 951 caactgcctg accttcacct tcggccactg ccaccgtaacctcaaccact 1001 ttgagaccta tgaggcctgc atgctggcct gcatgagcgg gccgctggcc1051 gcgtgcagcc tgcccgccct gcaggggccc tgcaaagcct acgcgcctcg 1101ctgggcttac aacagccaga cgggccagtg ccagtccttt gtctatggtg 1151 gctgcgagggcaatggcaac aactttgaga gccgtgaggc ctgtgaggag 1201 tcgtgcccct tccccagggggaaccagcgc tgtcgggcct gcaagcctcg 1251 gcagaagctc gttaccagct tctgtcgcagcgactttgtc atcctgggcc 1301 gagtctctga gctgaccgag gagcctgact cgggccgcgccctggtgact 1351 gtggatgagg tcctaaagga tgagaaaatg ggcctcaagt tcctgggcca1401 ggagccattg gaggtcactc tgcttcacgt ggactgggca tgcccctgcc 1451ccaacgtgac cgtgagcgag atgccgctca tcatcatggg ggaggtggac 1501 ggcggcatggccatgctgcg ccccgatagc tttgtgggcg catcgagtgc 1551 ccgccgggtc aggaagcttcgtgaggtcat gcacaagaag acctgtgacg 1601 tcctcaagga gtttcttggc ttgcac

In certain aspects, the subject nucleic acids encodingfollistatin-related polypeptides are further understood to includenucleic acids that are variants of SEQ ID NOs: 47-56. Variant nucleotidesequences include sequences that differ by one or more nucleotidesubstitutions, additions or deletions, such as allelic variants; andwill, therefore, include coding sequences that differ from thenucleotide sequence of the coding sequence designated in SEQ ID NOs:47-56.

In certain embodiments, the disclosure provides isolated or recombinantnucleic acid sequences that are at least 80%, 85%, 90%, 95%, 96% 97%,98%, 99%, or 100% identical to SEQ ID NOs: 47-56. One of ordinary skillin the art will appreciate that nucleic acid sequences complementary toSEQ ID NOs: 47-56, and variants of SEQ ID NO: 47-56 are also within thescope of this disclosure. In further embodiments, the nucleic acidsequences of the disclosure can be isolated, recombinant, and/or fusedwith a heterologous nucleotide sequence, or in a DNA library.

In other embodiments, nucleic acids of the invention also includenucleotide sequences that hybridize under highly stringent conditions tothe nucleotide sequence designated in SEQ ID NOs: 47-56, complementsequence of SEQ ID NOs: 47-56, or fragments thereof.

One of ordinary skill in the art will understand readily thatappropriate stringency conditions which promote DNA hybridization can bevaried. For example, one could perform the hybridization at 6.0× sodiumchloride/sodium citrate (SSC) at about 45° C., followed by a wash of2.0×SSC at 50° C. For example, the salt concentration in the wash stepcan be selected from a low stringency of about 2.0×SSC at 50° C. to ahigh stringency of about 0.2×SSC at 50° C. In addition, the temperaturein the wash step can be increased from low stringency conditions at roomtemperature, about 22° C., to high stringency conditions at about 65° C.Both temperature and salt may be varied, or temperature or saltconcentration may be held constant while the other variable is changed.In one embodiment, the invention provides nucleic acids which hybridizeunder low stringency conditions of 6×SSC at room temperature followed bya wash at 2×SSC at room temperature.

Isolated nucleic acids that differ from the nucleic acids as set forthin SEQ ID NOs: 47-56 due to degeneracy in the genetic code are alsowithin the scope of the disclosure. For example, a number of amino acidsare designated by more than one triplet. Codons that specify the sameamino acid, or synonyms (for example, CAU and CAC are synonyms forhistidine) may result in “silent” mutations that do not affect the aminoacid sequence of the protein. However, it is expected that DNA sequencepolymorphisms that do lead to changes in the amino acid sequences of thesubject proteins will exist among mammalian cells. One skilled in theart will appreciate that these variations in one or more nucleotides (upto about 3-5% of the nucleotides) of the nucleic acids encoding aparticular protein may exist among individuals of a given species due tonatural allelic variation. Any and all such nucleotide variations andresulting amino acid polymorphisms are within the scope of thisdisclosure.

In certain embodiments, the recombinant nucleic acids of the disclosuremay be operably linked to one or more regulatory nucleotide sequences inan expression construct. Regulatory nucleotide sequences will generallybe appropriate to the host cell used for expression. Numerous types ofappropriate expression vectors and suitable regulatory sequences areknown in the art for a variety of host cells. Typically, said one ormore regulatory nucleotide sequences may include, but are not limitedto, promoter sequences, leader or signal sequences, ribosomal bindingsites, transcriptional start and termination sequences, translationalstart and termination sequences, and enhancer or activator sequences.Constitutive or inducible promoters as known in the art are contemplatedby the disclosure. The promoters may be either naturally occurringpromoters, or hybrid promoters that combine elements of more than onepromoter. An expression construct may be present in a cell on anepisome, such as a plasmid, or the expression construct may be insertedin a chromosome. In a preferred embodiment, the expression vectorcontains a selectable marker gene to allow the selection of transformedhost cells. Selectable marker genes are well known in the art and willvary with the host cell used.

In certain aspects, the subject nucleic acid is provided in anexpression vector comprising a nucleotide sequence encoding afollistatin-related polypeptide and operably linked to at least oneregulatory sequence. Regulatory sequences are art-recognized and areselected to direct expression of the follistatin-related polypeptide.Accordingly, the term regulatory sequence includes promoters, enhancers,and other expression control elements. Exemplary regulatory sequencesare described in Goeddel; Gene Expression Technology: Methods inEnzymology, Academic Press, San Diego, Calif. (1990). For instance, anyof a wide variety of expression control sequences that control theexpression of a DNA sequence when operatively linked to it may be usedin these vectors to express DNA sequences encoding a follistatin-relatedpolypeptide. Such useful expression control sequences, include, forexample, the early and late promoters of SV40, tet promoter, adenovirusor cytomegalovirus immediate early promoter, RSV promoters, the lacsystem, the trp system, the TAC or TRC system, T7 promoter whoseexpression is directed by T7 RNA polymerase, the major operator andpromoter regions of phage lambda, the control regions for fd coatprotein, the promoter for 3-phosphoglycerate kinase or other glycolyticenzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters ofthe yeast α-mating factors, the polyhedron promoter of the baculovirussystem and other sequences known to control the expression of genes ofprokaryotic or eukaryotic cells or their viruses, and variouscombinations thereof. It should be understood that the design of theexpression vector may depend on such factors as the choice of the hostcell to be transformed and/or the type of protein desired to beexpressed. Moreover, the vector's copy number, the ability to controlthat copy number and the expression of any other protein encoded by thevector, such as antibiotic markers, should also be considered.

A recombinant nucleic acid of the disclosure can be produced by ligatingthe cloned gene, or a portion thereof, into a vector suitable forexpression in either prokaryotic cells, eukaryotic cells (yeast, avian,insect or mammalian), or both. Expression vehicles for production of arecombinant follistatin-related polypeptide include plasmids and othervectors. For instance, suitable vectors include plasmids of the types:pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived plasmids,pBTac-derived plasmids and pUC-derived plasmids for expression inprokaryotic cells, such as E. coli.

Some mammalian expression vectors contain both prokaryotic sequences tofacilitate the propagation of the vector in bacteria, and one or moreeukaryotic transcription units that are expressed in eukaryotic cells.The pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2,pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples ofmammalian expression vectors suitable for transfection of eukaryoticcells. Some of these vectors are modified with sequences from bacterialplasmids, such as pBR322, to facilitate replication and drug resistanceselection in both prokaryotic and eukaryotic cells. Alternatively,derivatives of viruses such as the bovine papilloma virus (BPV-1), orEpstein-Barr virus (pHEBo, pREP-derived and p205) can be used fortransient expression of proteins in eukaryotic cells. Examples of otherviral (including retroviral) expression systems can be found below inthe description of gene therapy delivery systems. The various methodsemployed in the preparation of the plasmids and in transformation ofhost organisms are well known in the art. For other suitable expressionsystems for both prokaryotic and eukaryotic cells, as well as generalrecombinant procedures, see Molecular Cloning A Laboratory Manual, 2ndEd., ed. by Sambrook, Fritsch and Maniatis (Cold Spring HarborLaboratory Press, 1989) Chapters 16 and 17. In some instances, it may bedesirable to express the recombinant polypeptides by the use of abaculovirus expression system. Examples of such baculovirus expressionsystems include pVL-derived vectors (such as pVL1392, pVL1393 andpVL941), pAcUW-derived vectors (such as pAcUW1), and pBlueBac-derivedvectors (such as the β-gal containing pBlueBac III).

In certain embodiments, a vector will be designed for production of thesubject follistatin-related polypeptides in CHO cells, such as aPcmv-Script vector (Stratagene, La Jolla, Calif), pcDNA4 vectors(Invitrogen, Carlsbad, Calif.) and pCI-neo vectors (Promega, Madison,Wisc.). As will be apparent, the subject gene constructs can be used tocause expression of the subject follistatin-related polypeptides incells propagated in culture, e.g., to produce proteins, including fusionproteins or variant proteins, for purification.

This disclosure also pertains to a host cell transfected with arecombinant gene including a coding sequence (e.g., SEQ ID NOs: 19-22)for one or more of the subject follistatin-related polypeptides. Thehost cell may be any prokaryotic or eukaryotic cell. For example, afollistatin-related polypeptide of the disclosure may be expressed inbacterial cells such as E. coli, insect cells (e.g., using a baculovirusexpression system), yeast, or mammalian cells. Other suitable host cellsare known to those skilled in the art.

Accordingly, the present disclosure further pertains to methods ofproducing the subject follistatin-related polypeptides. For example, ahost cell transfected with an expression vector encoding afollistatin-related polypeptide can be cultured under appropriateconditions to allow expression of the follistatin-related polypeptide tooccur. The follistatin-related polypeptide may be secreted and isolatedfrom a mixture of cells and medium containing the follistatin-relatedpolypeptide. Alternatively, the follistatin-related polypeptide may beretained cytoplasmically or in a membrane fraction and the cellsharvested, lysed and the protein isolated. A cell culture includes hostcells, media and other byproducts. Suitable media for cell culture arewell known in the art. The subject follistatin-related polypeptides canbe isolated from cell culture medium, host cells, or both, usingtechniques known in the art for purifying proteins, includingion-exchange chromatography, gel filtration chromatography,ultrafiltration, electrophoresis, and immunoaffinity purification withantibodies specific for particular epitopes of the follistatinpolypeptides. In a preferred embodiment, the follistatin-relatedpolypeptide is a fusion protein containing a domain that facilitates itspurification.

In another embodiment, a fusion gene coding for a purification leadersequence, such as a poly-(His)/enterokinase cleavage site sequence atthe N-terminus of the desired portion of the recombinantfollistatin-related polypeptide, can allow purification of the expressedfusion protein by affinity chromatography using a Ni²⁺ metal resin. Thepurification leader sequence can then be subsequently removed bytreatment with enterokinase to provide the purified follistatin-relatedpolypeptide (e.g., see Hochuli et al., (1987) J. Chromatography 411:177;and Janknecht et al., PNAS USA 88:8972).

Techniques for making fusion genes are well known. Essentially, thejoining of various DNA fragments coding for different polypeptidesequences is performed in accordance with conventional techniques,employing blunt-ended or stagger-ended termini for ligation, restrictionenzyme digestion to provide for appropriate termini, filling-in ofcohesive ends as appropriate, alkaline phosphatase treatment to avoidundesirable joining, and enzymatic ligation. In another embodiment, thefusion gene can be synthesized by conventional techniques includingautomated DNA synthesizers. Alternatively, PCR amplification of genefragments can be carried out using anchor primers that give rise tocomplementary overhangs between two consecutive gene fragments that cansubsequently be annealed to generate a chimeric gene sequence (see, forexample, Current Protocols in Molecular Biology, eds. Ausubel et al.,John Wiley & Sons: 1992).

4. Exemplary Therapeutic Uses

In certain embodiments, compositions of the present disclosure,including for example various protein complexes comprisingfollistatin-related fusion polypeptides disclosed herein, can be usedfor treating or preventing a disease or condition that is described inthis section, including diseases or disorders that are associated withabnormal activity of a follistatin-related polypeptide and/or afollistatin ligand (e.g., myostatin, activins, GDF11). These diseases,disorders or conditions are generally referred to herein as“follistatin-associated conditions.” In certain embodiments, the presentdisclosure provides methods of treating or preventing an individual inneed thereof through administering to the individual a therapeuticallyeffective amount of a follistatin-related fusion polypeptide asdescribed above. These methods are particularly aimed at therapeutic andprophylactic treatments of animals, and more particularly, humans.

As used herein, a therapeutic that “prevents” a disorder or conditionrefers to a compound that, in a statistical sample, reduces theoccurrence of the disorder or condition in the treated sample relativeto an untreated control sample, or delays the onset or reduces theseverity of one or more symptoms of the disorder or condition relativeto the untreated control sample. The term “treating” as used hereinincludes amelioration or elimination of the condition once it has beenestablished. In either case, prevention or treatment may be discerned inthe diagnosis provided by a physician or other health care provider andthe intended result of administration of the therapeutic agent.

In general, treatment or prevention of a disease or condition asdescribed in the present disclosure is achieved by administering afollistatin-related polypeptide, or compositions, complexes orcombinations comprising such polypeptide, of the present disclosure inan “effective amount”. An effective amount of an agent refers to anamount effective, at dosages and for periods of time necessary, toachieve the desired therapeutic or prophylactic result. A“therapeutically effective amount” of an agent of the present disclosuremay vary according to factors such as the disease state, age, sex, andweight of the individual, and the ability of the agent to elicit adesired response in the individual. A “prophylactically effectiveamount” refers to an amount effective, at dosages and for periods oftime necessary, to achieve the desired prophylactic result.

Follistatin-ligand complexes play essential roles in tissue growth aswell as early developmental processes such as the correct formation ofvarious structures or in one or more post-developmental capacitiesincluding sexual development, pituitary hormone production, and creationof muscle. Thus, follistatin-associated conditions include abnormaltissue growth and developmental defects.

Exemplary conditions for treatment include neuromuscular disorders(e.g., muscular dystrophy and muscle atrophy), congestive obstructivepulmonary disease (and muscle wasting associated with COPD), musclewasting syndrome, sarcopenia, and cachexia, adipose tissue disorders(e.g., obesity), type 2 diabetes (NIDDM, adult-onset diabetes), and bonedegenerative disease (e.g., osteoporosis). Other exemplary conditionsinclude musculodegenerative and neuromuscular disorders, tissue repair(e.g., wound healing), and neurodegenerative diseases (e.g., amyotrophiclateral sclerosis).

In certain embodiments, compositions (e.g., follistatin-related fusionproteins) of the invention are used as part of a treatment for amuscular dystrophy. The term “muscular dystrophy” refers to a group ofdegenerative muscle diseases characterized by gradual weakening anddeterioration of skeletal muscles and sometimes the heart andrespiratory muscles. Muscular dystrophies are genetic disorderscharacterized by progressive muscle wasting and weakness that begin withmicroscopic changes in the muscle. As muscles degenerate over time, theperson's muscle strength declines. Exemplary muscular dystrophies thatcan be treated with a regimen including the subject follistatin-relatedpolypeptides include: Duchenne muscular dystrophy (DMD), Becker musculardystrophy (BMD), Emery-Dreifuss muscular dystrophy (EDMD), limb-girdlemuscular dystrophy (LGMD), facioscapulohumeral muscular dystrophy (FSHor FSHD) (also known as Landouzy-Dejerine), myotonic mystrophy (MMD)(also known as Steinert's Disease), oculopharyngeal muscular dystrophy(OPMD), distal muscular dystrophy (DD), congenital muscular dystrophy(CMD).

Duchenne muscular dystrophy (DMD) was first described by the Frenchneurologist Guillaume Benjamin Amand Duchenne in the 1860s. Beckermuscular dystrophy (BMD) is named after the German doctor Peter EmilBecker, who first described this variant of DMD in the 1950s. DMD is oneof the most frequent inherited diseases in males, affecting one in 3,500boys. DMD occurs when the dystrophin gene, located on the short arm ofthe X chromosome, is broken. Since males only carry one copy of the Xchromosome, they only have one copy of the dystrophin gene. Without thedystrophin protein, muscle is easily damaged during cycles ofcontraction and relaxation. While early in the disease musclecompensates by regeneration, later on muscle progenitor cells cannotkeep up with the ongoing damage and healthy muscle is replaced bynon-functional fibro-fatty tissue.

BMD results from different mutations in the dystrophin gene. BMDpatients have some dystrophin, but it is either insufficient in quantityor poor in quality. Having some dystrophin protects the muscles of thosewith BMD from degenerating as badly or as quickly as those of peoplewith DMD.

For example, recent researches demonstrate that blocking or eliminatingfunction of myostatin (a follistatin ligand) in vivo can effectivelytreat at least certain symptoms in DMD and BMD patients. Thus, thesubject follistatin-related fusion polypeptides may act as myostatininhibitors (antagonists), and constitute an alternative means ofblocking the functions of myostatin in vivo in DMD and BMD patients.

Similarly, the subject follistatin-related fusion polypeptides providean effective means to increase muscle mass in other disease conditionsthat are in need of muscle growth. For example, amyotrophic lateralsclerosis (ALS, also called Lou Gehrig's disease or motor neurondisease) is a chronic, incurable, and unstoppable CNS disorder thatattacks the motor neurons, components of the CNS that connect the brainto the skeletal muscles. In ALS, the motor neurons deteriorate andeventually die, and though a person's brain normally remains fullyfunctioning and alert, the command to move never reaches the muscles.Most people who get ALS are between 40 and 70 years old. The first motorneurons that weaken are those leading to the arms or legs. Those withALS may have trouble walking, they may drop things, fall, slur theirspeech, and laugh or cry uncontrollably. Eventually the muscles in thelimbs begin to atrophy from disuse. This muscle weakness will becomedebilitating and a person will need a wheel chair or become unable tofunction out of bed. Most ALS patients die from respiratory failure orfrom complications of ventilator assistance like pneumonia, 3-5 yearsfrom disease onset.

Increased muscle mass induced by follistatin-related fusion polypeptidesmight also benefit those suffering from muscle wasting diseases.Myostatin expression correlates inversely with fat-free mass in humansand that increased expression of the MSTN gene is associated with weightloss in men with AIDS wasting syndrome. By inhibiting the function ofmyostatin in AIDS patients, at least certain symptoms of AIDS may bealleviated, if not completely eliminated, thus significantly improvingquality of life in AIDS patients.

Cancer anorexia-cachexia syndrome is among the most debilitating andlife-threatening aspects of cancer. This syndrome is a common feature ofmany types of cancer present in approximately 80% of cancer patients atdeath and is responsible not only for a poor quality of life and poorresponse to chemotherapy but also a shorter survival time than is foundin patients with comparable tumors but without weight loss. Cachexia istypically suspected in patients with cancer if an involuntary weightloss of greater than five percent of premorbid weight occurs within asix-month period. Associated with anorexia, wasting of fat and muscletissue, and psychological distress, cachexia arises from a complexinteraction between the cancer and the host. Cancer cachexia affectscytokine production, release of lipid-mobilizing andproteolysis-inducing factors, and alterations in intermediarymetabolism. Although anorexia is common, a decreased food intake aloneis unable to account for the changes in body composition seen in cancerpatients, and increasing nutrient intake is unable to reverse thewasting syndrome. Currently, there is no treatment to control or reversethe cachexic process. Since systemic overexpression of GDF8 in adultmice was found to induce profound muscle and fat loss analogous to thatseen in human cachexia syndromes (Zimmers et al., supra), the subjectfollistatin-related polypeptides may be beneficially used to prevent,treat, or alleviate the symptoms of the cachexia syndrome, where musclegrowth is desired.

In other embodiments, follistatin-related fusion polypeptides, orcombinations of such polypeptides, can be used for regulating body fatcontent in an animal and for treating or preventing conditions relatedthereto, and particularly, health-compromising conditions relatedthereto. According to the present invention, to regulate (control) bodyweight can refer to reducing or increasing body weight, reducing orincreasing the rate of weight gain, or increasing or reducing the rateof weight loss, and also includes actively maintaining, or notsignificantly changing body weight (e.g., against external or internalinfluences which may otherwise increase or decrease body weight). Oneembodiment of the present disclosure relates to regulating body weightby administering to an animal (e.g., a human) in need thereof afollistatin-related fusion polypeptides, or combinations of suchpolypeptides of the disclosure.

In some embodiments, follistatin-related fusion polypeptides, orcombinations of such polypeptides, of the present disclosure can be usedfor reducing body weight and/or reducing weight gain in an animal, andmore particularly, for treating or ameliorating obesity in patients atrisk for or suffering from obesity. In another specific embodiment, thepresent invention is directed to methods and compounds for treating ananimal that is unable to gain or retain weight (e.g., an animal with awasting syndrome). Such methods are effective to increase body weightand/or mass, or to reduce weight and/or mass loss, or to improveconditions associated with or caused by undesirably low (e.g.,unhealthy) body weight and/or mass. In addition, disorders of highcholesterol (e.g., hypercholesterolemia or dislipidemia) may be treatedwith an follistatin-related fusion polypeptides, or combinations of suchpolypeptides, of the disclosure.

In some embodiments, follistatin-related fusion polypeptides, orcombinations of such polypeptides, of the present disclosure can be usedfor treating a metabolic disorder such as type II diabetes, metabolicsyndrome, hyperadinectonemia, hyperglycemia or hyperinsulinemia.

Fibrosis generally refers to an excessive deposition of both collagenfibers and extracellular matrix combined with a relative decrease ofcell number in an organ or tissue. While this process is an importantfeature of natural wound healing following injury, fibrosis can lead topathological damage in various tissue and organs including, for example,the lungs, kidneys, liver, bone, muscle, and skin. The role the TGF-betasuperfamily in fibrosis has been extensively study. TGF-beta superfamilyligands have been implicated in fibrosis including, for example,activins (e.g., activin A and activin B) and GDF8 [Hedger et al (2013)Cytokine and Growth Factor Reviews 24:285-295; Hardy et al. (2015) 93:567-574; and Cantini et al. (2008) J Sex Med 5:1607-1622]. Therefore, insome embodiments, follistatin-related fusion polypeptides, orcombinations of such polypeptides, of the present disclosure can be usedto treat fibrosis, particularly fibrosis-associated disorders andconditions. For example, follistatin-related fusion polypeptides, orcombinations of such polypeptides, may be used to treat or prevent oneor more of: pulmonary fibrosis, hypersensitivity pneumonitis, idiopathicfibrosis, tuberculosis, pneumonia, cystic fibrosis, asthma, chronicobstructive pulmonary disease (COPD), emphysema, renal (kidney)fibrosis, renal (kidney) failure, chronic renal (kidney) disease, bonefibrosis, myelofibrosis, rheumatoid arthritis, systemic lupuserythematosus, scleroderma, sarcoidosis, granulomatosis withpolyangiitis, Peyronie's disease, liver fibrosis, Wilson's disease,glycogen storage diseases (particularly types III, IV, IX, and X),iron-overload, Gaucher disease, Zellweger syndrome, nonalcoholic andalcoholic steatohepatitis, biliary cirrhosis, sclerosing cholangitis,Budd-Chiari syndrome, surgery-associated fibrosis, Crohn's disease,Duputren's contracture, mediastinal fibrosis, nephrogeneic fibrosis,retroperitoneal fibrosis, atrial fibrosis, endomyocardial fibrosis,pancreatic fibrosis.

In some embodiments, follistatin-related fusion polypeptides, orcombinations of such polypeptides, of the present disclosure may be usedto treat or prevent chronic kidney disease, optionally in combinationwith one or more supportive therapies for treating chronic kidneydisease. In some embodiments, follistatin-related fusion polypeptides,or combinations of such polypeptides, of the present disclosure may beused to treat or prevent one or more complications (symptoms ormanifestations) of chronic kidney disease, optionally in combinationwith one or more supportive therapies for treating chronic kidneydisease. In some embodiments, follistatin-related fusion polypeptides,or combinations of such polypeptides, of the present disclosure may beused to treat or prevent end-stage kidney failure, optionally incombination with one or more supportive therapies for treating end-stagekidney disease. Chronic kidney disease (CKD), also known as chronicrenal disease, is a progressive loss in renal function over a period ofmonths or years. The symptoms of worsening kidney function may includefeeling generally unwell and experiencing a reduced appetite. Often,chronic kidney disease is diagnosed as a result of screening of peopleknown to be at risk of kidney problems, such as those with high bloodpressure or diabetes and those with a blood relative with CKD. Thisdisease may also be identified when it leads to one of its recognizedcomplications, such as cardiovascular disease, anemia, or pericarditis.Recent professional guidelines classify the severity of CKD in fivestages, with stage 1 being the mildest and usually causing few symptomsand stage 5 being a severe illness with poor life expectancy ifuntreated. Stage 5 CKD is often called end-stage kidney disease,end-stage renal disease, or end-stage kidney failure, and is largelysynonymous with the now outdated terms chronic renal failure or chronickidney failure; and usually means the patient requires renal replacementtherapy, which may involve a form of dialysis, but ideally constitutes akidney transplant. CKD is initially without specific symptoms and isgenerally only detected as an increase in serum creatinine or protein inthe urine. As the kidney function decreases and various symptoms maymanifest as described below. Blood pressure may be increased due tofluid overload and production of vasoactive hormones created by thekidney via the renin-angiotensin system, increasing one's risk ofdeveloping hypertension and/or suffering from congestive heart failure.Urea may accumulate, leading to azotemia and ultimately uremia (symptomsranging from lethargy to pericarditis and encephalopathy). Due to itshigh systemic circulation, urea is excreted in eccrine sweat at highconcentrations and crystallizes on skin as the sweat evaporates (“uremicfrost”). Potassium may accumulate in the blood (hyperkalemia with arange of symptoms including malaise and potentially fatal cardiacarrhythmias). Hyperkalemia usually does not develop until the glomerularfiltration rate falls to less than 20-25 ml/min/1.73 m2, at which pointthe kidneys have decreased ability to excrete potassium. Hyperkalemia inCKD can be exacerbated by acidemia (which leads to extracellular shiftof potassium) and from lack of insulin. Erythropoietin synthesis may bedecreased causing anemia. Fluid volume overload symptoms may occur,ranging from mild edema to life-threatening pulmonary edema.Hyperphosphatemia, due to reduced phosphate excretion, may occurgenerally following the decrease in glomerular filtration.Hyperphosphatemia is associated with increased cardiovascular risk,being a direct stimulus to vascular calcification. Hypocalcemia maymanifest, which is generally caused by stimulation of fibroblast growthfactor-23. Osteocytes are responsible for the increased production ofFGF23, which is a potent inhibitor of the enzyme 1-alpha-hydroxylase(responsible for the conversion of 25-hydroxycholecalciferol into 1,25dihydroxyvitamin D3). Later, this progresses to secondaryhyperparathyroidism, renal osteodystrophy, and vascular calcificationthat further impairs cardiac function. Metabolic acidosis (due toaccumulation of sulfates, phosphates, uric acid etc.) may occur andcause altered enzyme activity by excess acid acting on enzymes; and alsoincreased excitability of cardiac and neuronal membranes by thepromotion of hyperkalemia due to excess acid (acidemia). Acidosis isalso due to decreased capacity to generate enough ammonia from the cellsof the proximal tubule. Iron deficiency anemia, which increases inprevalence as kidney function decreases, is especially prevalent inthose requiring haemodialysis. It is multifactoral in cause, butincludes increased inflammation, reduction in erythropoietin, andhyperuricemia leading to bone marrow suppression. People with CKD sufferfrom accelerated atherosclerosis and are more likely to developcardiovascular disease than the general population. Patients afflictedwith CKD and cardiovascular disease tend to have significantly worseprognoses than those suffering only from the latter.

As used herein, “in combination with”, “combinations of”, or “conjointadministration” refers to any form of administration such thatadditional therapies (e.g., second, third, fourth, etc.) are stilleffective in the body (e.g., multiple compounds are simultaneouslyeffective in the patient, which may include synergistic effects of thosecompounds). Effectiveness may not correlate to measurable concentrationof the agent in blood, serum, or plasma. For example, the differenttherapeutic compounds can be administered either in the same formulationor in separate formulations, either concomitantly or sequentially, andon different schedules. Thus, an individual who receives such treatmentcan benefit from a combined effect of different therapies. One or morefollistatin-related fusion polypeptides, or combinations of suchpolypeptides of the disclosure can be administered concurrently with,prior to, or subsequent to, one or more other additional agents orsupportive therapies. In general, each therapeutic agent will beadministered at a dose and/or on a time schedule determined for thatparticular agent. The particular combination to employ in a regimen willtake into account compatibility of the antagonist of the presentdisclosure with the therapy and/or the desired.

5. Pharmaceutical Compositions

In certain embodiments, compounds (e.g., follistatin-relatedpolypeptides) of the present invention are formulated with apharmaceutically acceptable carrier. For example, a follistatin-relatedpolypeptide can be administered alone or as a component of apharmaceutical formulation (i.e., a therapeutic composition). Thesubject compounds may be formulated for administration in any convenientway for use in human or veterinary medicine.

In certain embodiments, the therapeutic method of the invention includesadministering the composition topically, systemically, locally, orlocally as an implant or device. When administered, the therapeuticcomposition for use in this invention is, of course, in a pyrogen-free,physiologically acceptable form. Further, the composition may desirablybe encapsulated or injected in a viscous form for delivery to a targettissue site (e.g., bone, cartilage, muscle, fat or neurons), forexample, a site having tissue damage. Topical administration may besuitable for wound healing and tissue repair. Therapeutically usefulagents other than the follistatin-related polypeptides, which may alsooptionally be included in the composition as described above, mayalternatively or additionally, be administered simultaneously orsequentially with the subject compounds (e.g., follistatin-relatedpolypeptides) in the methods of the invention.

In certain embodiments, compositions of the present invention mayinclude a matrix capable of delivering one or more therapeutic compounds(e.g., follistatin-related polypeptides) to a target tissue site,providing a structure for the developing tissue and optimally capable ofbeing resorbed into the body. For example, the matrix may provide slowrelease of the follistatin-related polypeptides. Such matrices may beformed of materials presently in use for other implanted medicalapplications.

The choice of matrix material is based on biocompatibility,biodegradability, mechanical properties, cosmetic appearance andinterface properties. The particular application of the subjectcompositions will define the appropriate formulation. Potential matricesfor the compositions may be biodegradable and chemically defined calciumsulfate, tricalciumphosphate, hydroxyapatite, polylactic acid andpolyanhydrides. Other potential materials are biodegradable andbiologically well defined, such as bone or dermal collagen. Furthermatrices are comprised of pure proteins or extracellular matrixcomponents. Other potential matrices are non-biodegradable andchemically defined, such as sintered hydroxyapatite, bioglass,aluminates, or other ceramics. Matrices may be comprised of combinationsof any of the above-mentioned types of material, such as polylactic acidand hydroxyapatite or collagen and tricalciumphosphate. The bioceramicsmay be altered in composition, such as in calcium-aluminate-phosphateand processing to alter pore size, particle size, particle shape, andbiodegradability.

In certain embodiments, methods of the invention can be administered fororally, e.g., in the form of capsules, cachets, pills, tablets, lozenges(using a flavored basis, usually sucrose and acacia or tragacanth),powders, granules, or as a solution or a suspension in an aqueous ornon-aqueous liquid, or as an oil-in-water or water-in-oil liquidemulsion, or as an elixir or syrup, or as pastilles (using an inertbase, such as gelatin and glycerin, or sucrose and acacia) and/or asmouth washes and the like, each containing a predetermined amount of anagent as an active ingredient. An agent may also be administered as abolus, electuary or paste.

In solid dosage forms for oral administration (capsules, tablets, pills,dragees, powders, granules, and the like), one or more therapeuticcompounds of the present invention may be mixed with one or morepharmaceutically acceptable carriers, such as sodium citrate ordicalcium phosphate, and/or any of the following: (1) fillers orextenders, such as starches, lactose, sucrose, glucose, mannitol, and/orsilicic acid; (2) binders, such as, for example, carboxymethylcellulose,alginates, gelatin, polyvinyl pyrrolidone, sucrose, and/or acacia; (3)humectants, such as glycerol; (4) disintegrating agents, such asagar-agar, calcium carbonate, potato or tapioca starch, alginic acid,certain silicates, and sodium carbonate; (5) solution retarding agents,such as paraffin; (6) absorption accelerators, such as quaternaryammonium compounds; (7) wetting agents, such as cetyl alcohol andglycerol monostearate; (8) absorbents, such as kaolin and bentoniteclay; (9) lubricants, such a talc, calcium stearate, magnesium stearate,solid polyethylene glycols, sodium lauryl sulfate, and mixtures thereof;and (10) coloring agents. In the case of capsules, tablets and pills,the pharmaceutical compositions may also comprise buffering agents.Solid compositions of a similar type may also be employed as fillers insoft and hard-filled gelatin capsules using such excipients as lactoseor milk sugars, as well as high molecular weight polyethylene glycols.

Liquid dosage forms for oral administration include pharmaceuticallyacceptable emulsions, microemulsions, solutions, suspensions, syrups,and elixirs. In addition to the active ingredient, the liquid dosageforms may contain inert diluents commonly used in the art, such as wateror other solvents, solubilizing agents and emulsifiers, such as ethylalcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzylalcohol, benzyl benzoate, propylene glycol, 1,3-butylene glycol, oils(in particular, cottonseed, groundnut, corn, germ, olive, castor, andsesame oils), glycerol, tetrahydrofuryl alcohol, polyethylene glycolsand fatty acid esters of sorbitan, and mixtures thereof. Besides inertdiluents, the oral compositions can also include adjuvants such aswetting agents, emulsifying and suspending agents, sweetening,flavoring, coloring, perfuming, and preservative agents.

Suspensions, in addition to the active compounds, may contain suspendingagents such as ethoxylated isostearyl alcohols, polyoxyethylenesorbitol, and sorbitan esters, microcrystalline cellulose, aluminummetahydroxide, bentonite, agar-agar and tragacanth, and mixturesthereof.

Certain compositions disclosed herein may be administered topically,either to skin or to mucosal membranes. The topical formulations mayfurther include one or more of the wide variety of agents known to beeffective as skin or stratum corneum penetration enhancers. Examples ofthese are 2-pyrrolidone, N-methyl-2-pyrrolidone, dimethylacetamide,dimethylformamide, propylene glycol, methyl or isopropyl alcohol,dimethyl sulfoxide, and azone. Additional agents may further be includedto make the formulation cosmetically acceptable. Examples of these arefats, waxes, oils, dyes, fragrances, preservatives, stabilizers, andsurface active agents. Keratolytic agents such as those known in the artmay also be included. Examples are salicylic acid and sulfur.

Dosage forms for the topical or transdermal administration includepowders, sprays, ointments, pastes, creams, lotions, gels, solutions,patches, and inhalants. The active compound may be mixed under sterileconditions with a pharmaceutically acceptable carrier, and with anypreservatives, buffers, or propellants that may be required. Theointments, pastes, creams and gels may contain, in addition to a subjectcompound of the invention (e.g., a follistatin-related polypeptide),excipients, such as animal and vegetable fats, oils, waxes, paraffins,starch, tragacanth, cellulose derivatives, polyethylene glycols,silicones, bentonites, silicic acid, talc and zinc oxide, or mixturesthereof.

Powders and sprays can contain, in addition to a subject compound,excipients such as lactose, talc, silicic acid, aluminum hydroxide,calcium silicates, and polyamide powder, or mixtures of thesesubstances. Sprays can additionally contain customary propellants, suchas chlorofluorohydrocarbons and volatile unsubstituted hydrocarbons,such as butane and propane.

In certain embodiments, pharmaceutical compositions suitable forparenteral administration may comprise one or more follistatin-relatedpolypeptides in combination with one or more pharmaceutically acceptablesterile isotonic aqueous or nonaqueous solutions, dispersions,suspensions or emulsions, or sterile powders which may be reconstitutedinto sterile injectable solutions or dispersions just prior to use,which may contain antioxidants, buffers, bacteriostats, solutes whichrender the formulation isotonic with the blood of the intended recipientor suspending or thickening agents. Examples of suitable aqueous andnonaqueous carriers that may be employed in the pharmaceuticalcompositions of the invention include water, ethanol, polyols (such asglycerol, propylene glycol, polyethylene glycol, and the like), andsuitable mixtures thereof, vegetable oils, such as olive oil, andinjectable organic esters, such as ethyl oleate. Proper fluidity can bemaintained, for example, by the use of coating materials, such aslecithin, by the maintenance of the required particle size in the caseof dispersions, and by the use of surfactants.

The compositions of the invention may also contain adjuvants, such aspreservatives, wetting agents, emulsifying agents and dispersing agents.Prevention of the action of microorganisms may be ensured by theinclusion of various antibacterial and antifungal agents, for example,paraben, chlorobutanol, phenol sorbic acid, and the like. It may also bedesirable to include isotonic agents, such as sugars, sodium chloride,and the like into the compositions. In addition, prolonged absorption ofthe injectable pharmaceutical form may be brought about by the inclusionof agents that delay absorption, such as aluminum monostearate andgelatin.

It is understood that the dosage regimen will be determined by theattending physician, considering various factors that modify the actionof the subject compounds of the invention (e.g., follistatin-relatedpolypeptides). The various factors will depend upon the disease to betreated.

In certain embodiments, the present invention also provides gene therapyfor the in vivo production of follistatin-related polypeptides or othercompounds disclosed herein. Such therapy would achieve its therapeuticeffect by introduction of the follistatin-related polynucleotidesequences into cells or tissues having the disorders as listed above.Delivery of follistatin-related polynucleotide sequences can be achievedusing a recombinant expression vector such as a chimeric virus or acolloidal dispersion system. Preferred for therapeutic delivery offollistatin-related polynucleotide sequences is the use of targetedliposomes.

Various viral vectors which can be utilized for gene therapy as taughtherein include adenovirus, herpes virus, vaccinia, or, preferably, anRNA virus such as a retrovirus. Preferably, the retroviral vector is aderivative of a murine or avian retrovirus. Examples of retroviralvectors in which a single foreign gene can be inserted include, but arenot limited to: Moloney murine leukemia virus (MoMuLV), Harvey murinesarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and RousSarcoma Virus (RSV). A number of additional retroviral vectors canincorporate multiple genes. All of these vectors can transfer orincorporate a gene for a selectable marker so that transduced cells canbe identified and generated. Retroviral vectors can be madetarget-specific by attaching, for example, a sugar, a glycolipid, or aprotein. Preferred targeting is accomplished by using an antibody. Thoseof skill in the art will recognize that specific polynucleotidesequences can be inserted into the retroviral genome or attached to aviral envelope to allow target specific delivery of the retroviralvector containing the follistatin-related polynucleotide. In onepreferred embodiment, the vector is targeted to bone, cartilage, muscleor neuron cells/tissues.

Alternatively, tissue culture cells can be directly transfected withplasmids encoding the retroviral structural genes gag, pol and env, byconventional calcium phosphate transfection. These cells are thentransfected with the vector plasmid containing the genes of interest.The resulting cells release the retroviral vector into the culturemedium.

Another targeted delivery system for follistatin-related polynucleotidesis a colloidal dispersion system. Colloidal dispersion systems includemacromolecule complexes, nanocapsules, microspheres, beads, andlipid-based systems including oil-in-water emulsions, micelles, mixedmicelles, and liposomes. The preferred colloidal system of thisinvention is a liposome. Liposomes are artificial membrane vesicleswhich are useful as delivery vehicles in vitro and in vivo. RNA, DNA andintact virions can be encapsulated within the aqueous interior and bedelivered to cells in a biologically active form (see e.g., Fraley, etal., Trends Biochem. Sci., 6:77, 1981). Methods for efficient genetransfer using a liposome vehicle, are known in the art, see e.g.,Mannino, et al., Biotechniques, 6:682, 1988. The composition of theliposome is usually a combination of phospholipids, usually incombination with steroids, especially cholesterol. Other phospholipidsor other lipids may also be used. The physical characteristics ofliposomes depend on pH, ionic strength, and the presence of divalentcations.

Examples of lipids useful in liposome production include phosphatidylcompounds, such as phosphatidylglycerol, phosphatidylcholine,phosphatidylserine, phosphatidylethanolamine, sphingolipids,cerebrosides, and gangliosides. Illustrative phospholipids include eggphosphatidylcholine, dipalmitoylphosphatidylcholine, and distearoylphosphatidylcholine. The targeting of liposomes is also possiblebased on, for example, organ-specificity, cell-specificity, andorganelle-specificity and is known in the art.

EXEMPLIFICATION

The invention now being generally described, it will be more readilyunderstood by reference to the following examples, which are includedmerely for purposes of illustration of certain embodiments andembodiments of the present invention, and are not intended to limit theinvention.

Example 1 Generation of a Single-Arm WFIKKN2 Polypeptide Fusion Protein

Applicants generated a soluble asymmetric Fc fusion protein in whichnative full-length human WFIKKN2 polypeptide was attached through alinker to one of two human G1Fc chains.

A methodology for promoting formation of WFIKKN2-Fc heteromericcomplexes, as opposed to WFIKKN2-Fc homodimeric complexes, is tointroduce alterations in the amino acid sequence of the Fc domains toguide the formation of asymmetric heteromeric complexes. Many differentapproaches to making asymmetric interaction pairs using Fc domains aredescribed in this disclosure.

In one approach, illustrated in the polypeptide sequences of SEQ ID NOs:57 and 58, one Fc domain is altered to introduce cationic amino acids atthe interaction face, while the other Fc domain is altered to introduceanionic amino acids at the interaction face. In this example, correctpairing of the two polypeptide chains is promoted through a charge-basedmechanism by substituting lysine residues at two positions (underlined)in WFIKKN2-G1Fc(E134K/D177K) (SEQ ID NO: 57) and aspartate residues attwo positions (underlined) in G1Fc(K170D/K187D) (SEQ ID NO: 58). Anoptional N-terminal extension of 13 amino acids (underlined) is includedon the short chain (SEQ ID NO: 58) to facilitate disulfide bondformation by the cysteine at position 4.

(SEQ ID NO: 57)   1 LPPIRYSHAG ICPNDMNPNL WVDAQSTCRR ECETDQECETYEKCCPNVCG  51 TKSCVAARYM DVKGKKGPVG MPKEATCDHF MCLQQGSECD IWDGQPVCKC101 KDRCEKEPSF TCASDGLTYY NRCYMDAEAC SKGITLAVVT CRYHFTWPNT 151SPPPPETTMH PTTASPETPE LDMAAPALLN NPVHQSVTMG ETVSFLCDVV 201 GRPRPEITWEKQLEDRENVV MRPNHVRGNV VVTNIAQLVI YNAQLQDAGI 251 YTCTARNVAG VLRADFPLSVVRGHQAAATS ESSPNGTAFP AAECLKPPDS 301 EDCGEEQTRW HFDAQANNCL TFTFGHCHRNLNHFETYEAC MLACMSGPLA 351 ACSLPALQGP CKAYAPRWAY NSQTGQCQSF VYGGCEGNGNNFESREACEE 401 SCPFPRGNQR CRACKPRQKL VTSFCRSDFV ILGRVSELTE EPDSGRALVT451 VDEVLKDEKM GLKFLGQEPL EVTLLHVDWA CPCPNVTVSE MPLIIMGEVD 501GGMAMLRPDS FVGASSARRV RKLREVMHKK TCDVLKEFLG LHTGGGGSGG 551 GGSGGGGSGGGGSTHTCPPC PAPELLGGPS VFLFPPKPKD TLMISRTPEV 601 TCVVVDVSHE DPEVKFNWYVDGVEVHNAKT KPREEQYNST YRVVSVLTVL 651 HQDWLNGKEY KCKVSNKALP APIEKTISKAKGQPREPQVY TLPPSRKEMT 701 KNQVSLTCLV KGFYPSDIAV EWESNGQPENNYKTTPPVLK SDGSFFLYSK 751 LTVDKSRWQQ GNVFSCSVMH EALHNHYTQK SLSLSPGK (SEQID NO: 58) -13 SNTKVDKRVT GGG   1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLMISRTPEVTCV VVDVSHEDPE  51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQDWLNGKEYKCK 101 VSNKALPAPI EKTISKAKGQ PREPQVYTLP PSREEMTKNQ VSLTCLVKGF151 YPSDIAVEWE SNGQPENNYD TTPPVLDSDG SFFLYSDLTV DKSRWQQGNV 201FSCSVMHEAL HNHYTQKSLS LSPGK

Corresponding nucleic acid sequences for the mature (processed) forms ofthese variants are SEQ ID NOs: 59, 60.

(SEQ ID NO: 59) 1 CTGCCGCCCA TCCGCTATTC CCACGCCGGC ATCTGCCCCA ACGACATGAA51 TCCCAACCTC TGGGTGGACG CACAGAGCAC CTGCAGGCGG GAGTGTGAGA 101 CGGACCAGGAGTGTGAGACC TATGAGAAGT GCTGCCCCAA CGTATGTGGG 151 ACCAAGAGCT GCGTGGCGGCCCGCTACATG GACGTGAAAG GGAAGAAGGG 201 CCCAGTGGGC ATGCCCAAGG AGGCCACATGTGACCACTTC ATGTGTCTGC 251 AGCAGGGCTC TGAGTGTGAC ATCTGGGATG GCCAGCCCGTGTGTAAGTGC 301 AAAGACCGCT GTGAGAAGGA GCCCAGCTTT ACCTGCGCCT CGGACGGCCT351 CACCTACTAT AACCGCTGCT ACATGGATGC CGAGGCCTGC TCCAAAGGCA 401TCACACTGGC CGTTGTAACC TGCCGCTATC ACTTCACCTG GCCCAACACC 451 AGCCCCCCACCACCTGAGAC CACCATGCAC CCCACCACAG CCTCCCCAGA 501 GACCCCTGAG CTGGACATGGCGGCCCCTGC GCTGCTCAAC AACCCTGTGC 551 ACCAGTCGGT CACCATGGGT GAGACAGTGAGCTTCCTCTG TGATGTGGTG 601 GGCCGGCCCC GGCCTGAGAT CACCTGGGAG AAGCAGTTGGAGGATCGGGA 651 GAATGTGGTC ATGCGGCCCA ACCATGTGCG TGGCAACGTG GTGGTCACCA701 ACATTGCCCA GCTGGTCATC TATAACGCCC AGCTGCAGGA TGCTGGGATC 751TACACCTGCA CGGCCCGGAA CGTGGCTGGG GTCCTGAGGG CTGATTTCCC 801 GCTGTCGGTGGTCAGGGGTC ATCAGGCTGC AGCCACCTCA GAGAGCAGCC 851 CCAATGGCAC GGCTTTCCCGGCGGCCGAGT GCCTGAAGCC CCCCGACAGT 901 GAGGACTGTG GCGAAGAGCA GACCCGCTGGCACTTCGATG CCCAGGCCAA 951 CAACTGCCTG ACCTTCACCT TCGGCCACTG CCACCGTAACCTCAACCACT 1001 TTGAGACCTA TGAGGCCTGC ATGCTGGCCT GCATGAGCGG GCCGCTGGCC1051 GCGTGCAGCC TGCCCGCCCT GCAGGGGCCC TGCAAAGCCT ACGCGCCTCG 1101CTGGGCTTAC AACAGCCAGA CGGGCCAGTG CCAGTCCTTT GTCTATGGTG 1151 GCTGCGAGGGCAATGGCAAC AACTTTGAGA GCCGTGAGGC CTGTGAGGAG 1201 TCGTGCCCCT TCCCCAGGGGGAACCAGCGC TGTCGGGCCT GCAAGCCTCG 1251 GCAGAAGCTC GTTACCAGCT TCTGTCGCAGCGACTTTGTC ATCCTGGGCC 1301 GAGTCTCTGA GCTGACCGAG GAGCCTGACT CGGGCCGCGCCCTGGTGACT 1351 GTGGATGAGG TCCTAAAGGA TGAGAAAATG GGCCTCAAGT TCCTGGGCCA1401 GGAGCCATTG GAGGTCACTC TGCTTCACGT GGACTGGGCA TGCCCCTGCC 1451CCAACGTGAC CGTGAGCGAG ATGCCGCTCA TCATCATGGG GGAGGTGGAC 1501 GGCGGCATGGCCATGCTGCG CCCCGATAGC TTTGTGGGCG CATCGAGTGC 1551 CCGCCGGGTC AGGAAGCTTCGTGAGGTCAT GCACAAGAAG ACCTGTGACG 1601 TCCTCAAGGA GTTTCTTGGC TTGCACACCGGTGGTGGAGG TTCTGGAGGT 1651 GGAGGAAGTG GTGGAGGTGG TTCTGGAGGT GGTGGAAGTACTCACACATG 1701 CCCACCGTGC CCAGCACCTG AACTCCTGGG GGGACCGTCA GTCTTCCTCT1751 TCCCCCCAAA ACCCAAGGAC ACCCTCATGA TCTCCCGGAC CCCTGAGGTC 1801ACATGCGTGG TGGTGGACGT GAGCCACGAA GACCCTGAGG TCAAGTTCAA 1851 CTGGTACGTGGACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG 1901 AGGAGCAGTA CAACAGCACGTACCGTGTGG TCAGCGTCCT CACCGTCCTG 1951 CACCAGGACT GGCTGAATGG CAAGGAGTACAAGTGCAAGG TCTCCAACAA 2001 AGCCCTCCCA GCCCCCATCG AGAAAACCAT CTCCAAAGCCAAAGGGCAGC 2051 CCCGAGAACC ACAGGTGTAC ACCCTGCCCC CATCCCGGAA GGAGATGACC2101 AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT ATCCCAGCGA 2151CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 2201 CCACGCCTCCCGTGCTGAAG TCCGACGGCT CCTTCTTCCT CTATAGCAAG 2251 CTCACCGTGG ACAAGAGCAGGTGGCAGCAG GGGAACGTCT TCTCATGCTC 2301 CGTGATGCAT GAGGCTCTGC ACAACCACTACACGCAGAAG AGCCTCTCCC 2351 TGTCTCCGGG TAAA (SEQ ID NO: 60) 1 AGCAACACCAAGGTGGACAA GAGAGTTACC GGTGGTGGAA CTCACACATG 51 CCCACCGTGC CCAGCACCTGAACTCCTGGG GGGACCGTCA GTCTTCCTCT 101 TCCCCCCAAA ACCCAAGGAC ACCCTCATGATCTCCCGGAC CCCTGAGGTC 151 ACATGCGTGG TGGTGGACGT GAGCCACGAA GACCCTGAGGTCAAGTTCAA 201 CTGGTACGTG GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG251 AGGAGCAGTA CAACAGCACG TACCGTGTGG TCAGCGTCCT CACCGTCCTG 301CACCAGGACT GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTCCAACAA 351 AGCCCTCCCAGCCCCCATCG AGAAAACCAT CTCCAAAGCC AAAGGGCAGC 401 CCCGAGAACC ACAGGTGTACACCCTGCCCC CATCCCGGGA GGAGATGACC 451 AAGAACCAGG TCAGCCTGAC CTGCCTGGTCAAAGGCTTCT ATCCCAGCGA 501 CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAACAACTACGACA 551 CCACGCCTCC CGTGCTGGAC TCCGACGGCT CCTTCTTCCT CTATAGCGAC601 CTCACCGTGG ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC 651CGTGATGCAT GAGGCTCTGC ACAACCACTA CACGCAGAAG AGCCTCTCCC 701 TGTCTCCGGGTAAA

The proteins of SEQ ID NO: 57 and SEQ ID NO: 58 may be co-expressed andpurified from a CHO cell line, to give rise to a heteromeric complexcomprising WFIKKN2-Fc.

In another approach to promote the formation of heteromultimer complexesusing asymmetric Fc fusion proteins the Fc domains are altered tointroduce complementary hydrophobic interactions and an additionalintermolecular disulfide bond as illustrated in the polypeptidesequences of SEQ ID NOs: 71 and 72.

(SEQ ID NO: 71) 1 LPPIRYSHAG ICPNDMNPNL WVDAQSTCRR ECETDQECET YEKCCPNVCG51 TKSCVAARYM DVKGKKGPVG MPKEATCDHF MCLQQGSECD IWDGQPVCKC 101 KDRCEKEPSFTCASDGLTYY NRCYMDAEAC SKGITLAVVT CRYHFTWPNT 151 SPPPPETTMH PTTASPETPELDMAAPALLN NPVHQSVTMG ETVSFLCDVV 201 GRPRPEITWE KQLEDRENVV MRPNHVRGNVVVTNIAQLVI YNAQLQDAGI 251 YTCTARNVAG VLRADFPLSV VRGHQAAATS ESSPNGTAFPAAECLKPPDS 301 EDCGEEQTRW HFDAQANNCL TFTFGHCHRN LNHFETYEAC MLACMSGPLA351 ACSLPALQGP CKAYAPRWAY NSQTGQCQSF VYGGCEGNGN NFESREACEE 401SCPFPRGNQR CRACKPRQKL VTSFCRSDFV ILGRVSELTE EPDSGRALVT 451 VDEVLKDEKMGLKFLGQEPL EVTLLHVDWA CPCPNVTVSE MPLIIMGEVD 501 GGMAMLRPDS FVGASSARRVRKLREVMHKK TCDVLKEFLG LHTGGGTHTC 601 PPCPAPELLG GPSVFLFPPK PKDTLMISRTPEVTCVVVDV SHEDPEVKFNW 701 YVDGVEVHNA KTKPREEQYN STYRVVSVLT VLHQDWLNGKEYKCKVSNKAL 801 PAPIEKTISK AKGQPREPQV YTLPPCREEM TKNQVSLWCL VKGFYPSDIAV901 EWESNGQPEN NYKTTPPVLD SDGSFFLYSK LTVDKSRWQQ GNVFSCSVMHE 1001ALHNHYTQKS LSLSPGK

To promote formation of the heterodimer rather than either of thepossible homodimeric complexes, two amino acid substitutions (replacinga serine with a cysteine and a threonine with a trytophan) can beintroduced into the Fc domain of the fusion protein as indicated bydouble underline above. The amino acid sequence of SEQ ID NO: 71 mayoptionally be provided with lysine (K) removed from the C-terminus.

The complementary form of Fc fusion polypeptide (SEQ ID NO: 72) is asfollows and may optionally be provided with lysine (K) removed from theC-terminus.

(SEQ ID NO: 72) -13 SNTKVDKRVT GGG   1 THTCPPCPAP ELLGGPSVFL FPPKPKDTLMISRTPEVTCV VVDVSHEDPE  51 VKFNWYVDGV EVHNAKTKPR EEQYNSTYRV VSVLTVLHQDWLNGKEYKCK 101 VSNKALPAPI EKTISKAKGQ PREPQVCTLP PSREEMTKNQ VSLSCAVKGF151 YPSDIAVEWE SNGQPENNYK TTPPVLDSDG SFFLVSKLTV DKSRWQQGNV 201FSCSVMHEAL HNHYTQKSLS LSPGK

To guide heterodimer formation with the polypeptide of SEQ ID NOs: 71above, four amino acid substitutions can be introduced into the Fcpolypeptide as indicated by double underline above. The amino acidsequence of SEQ ID NO: 72 may optionally be provided with lysine (K)removed from the C-terminus.

The proteins of SEQ ID NO: 71 and SEQ ID NO: 72 may be co-expressed andpurified from a CHO cell line, to give rise to a heteromeric complexcomprising WFIKKN2-Fc.

The present disclosure provides for expression of follistatin-relatedpolypeptides in cells with, for example, one of the following leadersequences:

Native human follistatin leader: (SEQ ID NO: 61)MVRARHQPGGLCLLLLLLCQFMEDRSAQA Native human FLRG leader: (SEQ ID NO: 62)MRPGAPGPLWPLPWGALAWAVGFVSS Native human WFIKKN1 leader: (SEQ ID NO: 63)MPALRPLLPLLLLLRLTSG Native human WFIKKN2 leader: (SEQ ID NO: 64)MWAPRCRRFWSRWEQVAALLLLLLLLGVPPRSLA Tissue plasminogen activator (TPA):(SEQ ID NO: 65) MDAMKRGLCCVLLLCGAVFVSP Honey bee melittin (HBML): (SEQID NO: 66) MKFLVNVALVFMVVYISYIYA

Selected follistatin-related polypeptide variants incorporate the TPAleader and are fused to a G1Fc domain (SEQ ID NO: 39, 40, 41, 42, 43,44, 45, or 46) with or without an optional linker to form a long chain.A short chain comprising a complementary G1Fc domain (SEQ ID NO: 39, 40,41, 42, 43, 44, 45, or 46) that promotes pairing with the long chainalso incorporates the TPA leader. Constructs were coexpressed in COS orCHO cells and purified from conditioned media by filtration and proteinA chromatography. Purity of samples for reporter gene assays wasevaluated by SDS-PAGE and Western blot analysis.

Two variants incorporating native full-length human WFIKKN2 polypeptidewere generated for direct comparison with the single-arm WFIKKN2-hG1Fcfusion protein produced by coexpression. The first variant was adual-arm WFIKKN2-hG1Fc fusion protein and the second variant was asingle-chain WFIKKN2 polypeptide attached at its C-terminus to a His6tag.

Applicants transiently transfected COS cells with constructs encodingsingle-arm WFIKKN2-hG1Fc and dual-arm WFIKKN2-hG1Fc. CHO cells were usedto stably express WFIKKN2-His6. A UCOE™-based construct encodingWFIKKN2-His6 was stably transfected into a CHO cell line, clones wereselected with methotrexate, and any clones that formed colonies werethen pooled. No gene amplification was performed since it is difficultto amplify UCOE™ pools while maintaining stability of expression.Instead of dilution cloning, high-expressing pools were identified andused for generating WFIKKN2-His6.

Purification of Fc-containing constructs was achieved with a variety oftechniques, including, for example, filtration of conditioned media,followed by protein A chromatography, elution with glycine buffer (pH3.0), sample neutralization, and size-exclusion chromatography. Purityof Fc-containing constructs was evaluated by analytical size-exclusionchromatography and SDS-PAGE.

Purification of WFIKKN2-His6 was achieved using a variety of techniques,including, for example, diafiltration of conditioned media, followed bynickel-nitrilotriacetic acid (Qiagen) agarose affinity chromatography,elution with imidazole buffer, and dialysis against PBS. Purity ofsamples was evaluated for all constructs by analytical size-exclusionchromatography and SDS-PAGE. Analysis of mature protein confirmed theexpected N-terminal sequence for WFIKKN2-His6.

Example 2 Potency of a Single-Arm WFIKKN2 Polypeptide Fusion Protein

A reporter gene assay in A204 rhabdomyosarcoma cells was used toevaluate the ability of full-length human WFIKKN2 polypeptide variantsto inhibit myostatin signaling. This assay is based on a humanrhabdomyosarcoma cell line transfected with a pGL3(CAGA)12 reporterplasmid [Dennler et al (1998) EMBO 17:3091-3100] as well as a controlRenilla reporter plasmid (pRL-CMV) to normalize for transfectionefficiency. The CAGA12 motif is present in TGFβ-responsive genes such asplasminogen activator inhibitor type 1, so this vector is of general usefor factors signaling through Smad2 and Smad3.

On the first day of the assay, A204 cells (ATCC® HTB-82) weredistributed in 48-well plates at 10⁵ cells per well and incubatedovernight in McCoy's 5A growth medium (Life Technologies) supplementedwith 10% FBS. All incubations were at 37° C. with 5% CO2 unlessotherwise noted. On the second day, a solution containing 10 μgpGL3(CAGA)12, 0.1 μg pRL-CMV, 30 μl X-tremeGENE 9 (Roche Diagnostics),and 970 μl OptiMEM (Life Technologies) was incubated for 30 minutes atroom temperature prior to adding to assay buffer (McCoy's 5A medium+0.1%bovine serum albumin) and applying to the plated cells (500 μl/well) foran overnight incubation. On the third day, medium was removed, and cellswere incubated for 6 h with a mixture of ligands and inhibitors preparedas described below.

To evaluate the ability of WFIKKN2 constructs to inhibit myostatinsignaling, a serial dilution of each test article (two replicates each)was made in a 48-well plate in assay buffer to a final volume of 200 μl.An equal volume of myostatin (R&D Systems, final concentration of 32ng/ml) in assay buffer was then added. The test solutions were incubatedfor 30 minutes prior to adding 250 μl of this mixture to each well ofthe 48-well plate of transfected A204 cells. After incubation with testsolutions for 6 h, cells were rinsed with phosphate-buffered saline,then lysed with passive lysis buffer (Promega E1941) and storedovernight at −70° C. On the fourth and final day, the plates were warmedto room temperature with gentle shaking. Cell lysates were transferredto a 96-well chemiluminescence plate and analyzed in a luminometer withreagents from a Dual-Luciferase Reporter Assay system (Promega E1980).The luciferase activity of the experimental reporter was normalized tothe luciferase activity obtained with the Renilla control reporter.

WFIKKN2 polypeptide constructs differed markedly in their ability toinhibit signaling by myostatin. As shown in the table below,single-chain WFIKKN2-His6 and single-arm WFIKKN2-G1Fc potently inhibitedmyostatin signaling, with IC₅₀ values in the low nanomolar range. Incontrast, dual-arm WFIKKN2-G1Fc did not show any reduction in myostatinsignaling over the range of concentrations tested, which suggests thatthe IC₅₀ value for this construct would be at least 100 nM. Thus, thepotency of myostatin inhibition with single-arm WFIKKN2-G1Fc wassubstantially higher than that of a G1Fc fusion protein comprising dualWFIKKN2 arms.

Construct IC₅₀ (nM) Half-life in Mouse (h) Single-chain WFIKKN2-His6 1.4~2.5 Single-arm WFIKKN2-G1Fc 2.9 ~110 Dual-arm WFIKKN2-G1Fc >100 ND ND,not determined

Elimination pharmacokinetics of single-chain WFIKKN2-His6 and single-armWFIKKN2-G1Fc were studied in separate experiments conducted in mice.Concentrations of single-chain WFIKKN2-His6 in mouse serum samples aftersubcutaneous administration of a single dose were measured by ELISA witha commercial anti-His6 antibody (Abeam® ab 18184). Concentrations ofsingle-arm WFIKKN2-G1Fc after subcutaneous administration of a singledose were measured by ELISA with a commercial anti-human IgG1 antibody(Binding Site Immunologicals AP006). As indicated in the table above,the half-life of single-chain WFIKKN2-His6 protein in mice wasapproximately 2.5 hours (data not shown), whereas the half-life ofsingle-arm WFIKKN2-G1Fc fusion protein in mice (n=3) was more than 100hours (data not shown), which would predict that this molecule will havea pharmacologically useful serum half-life in humans of 10-20 days.Thus, by utilizing a heterodimeric or asymmetric approach to generatinga single-arm WFIKKN2 construct, applicants were able to combine thedesirable ligand binding activity of the single chain (native) proteinwith the desirable serum half-life of a traditional homodimeric Fcfusion protein.

Together, these results indicate that single-arm WFIKKN2-G1Fc could be auseful therapeutic agent. Beyond this example, applicants predict thatother asymmetric fusion proteins comprising single-armfollistatin-related polypeptides will also display more potentinhibition of ligand signaling than their dual-arm counterparts whileconferring ease of purification and a serum half-life that is typicalfor a homodimeric Fc fusion protein construct.

INCORPORATION BY REFERENCE

All publications and patents mentioned herein are hereby incorporated byreference in their entirety as if each individual publication or patentwas specifically and individually indicated to be incorporated byreference.

While specific embodiments of the subject matter have been discussed,the above specification is illustrative and not restrictive. Manyvariations will become apparent to those skilled in the art upon reviewof this specification and the claims below. The full scope of theinvention should be determined by reference to the claims, along withtheir full scope of equivalents, and the specification, along with suchvariations.

1. A protein complex comprising a first polypeptide covalently ornon-covalently associated with a second polypeptide, wherein: a. thefirst polypeptide comprises the amino acid sequence of afollistatin-related polypeptide and the amino acid sequence of a firstmember of an interaction pair; and b. the second polypeptide comprisesthe amino acid sequence of a second member of the interaction pair, andwherein the second polypeptide does not comprise a follistatin-relatedpolypeptide.
 2. The protein complex of claim 1, wherein the amino acidsequence of a follstatin-related polypeptide comprises an amino acidsequence that is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100%identical to the amino acid sequence of any of SEQ ID Nos: 1-33.
 3. Theprotein complex of claim 1, wherein the first polypeptide comprises alinker polypeptide positioned between the amino acid sequence of thefollistatin-related polypeptide and the amino acid sequence of the firstmember of the interaction pair.
 4. The protein complex of claim 1,wherein the first member of the interaction pair comprises a constantdomain of an immunoglobulin and wherein the second member of theinteraction pair comprises a constant domain of an immunoglobulin. 5-9.(canceled)
 10. The protein complex of claim 1, wherein the first memberof the interaction pair associates covalently or non-covalently with thesecond member of the interaction pair to form a dimeric protein complex.11. The protein complex of claim 1, wherein the interaction pair is anasymmetric interaction pair.
 12. The protein complex of claim 11,wherein the first member of the asymmetric interaction pair comprises afirst modified constant domain of an IgG and wherein the second memberof the asymmetric interaction pair comprises a second modified constantdomain of an IgG, and wherein the first and second members of theasymmetric interaction pair associate to form a heterodimeric complex.13. The protein complex of claim 11, wherein the first member of theasymmetric interaction pair comprises a first modified Fc portion of IgGand wherein the second member of the asymmetric interaction paircomprises a second modified Fc portion of an IgG, and wherein the firstand second members of the asymmetric interaction pair associate to forma heterodimeric complex.
 14. The protein complex of claim 13, whereinthe first Fc portion of an IgG comprises an amino acid sequence that isat least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to anamino acid sequence selected from the group: SEQ ID NOs 34-46, andwherein the second Fc portion of an IgG comprises an amino acid sequencethat is different from the amino acid sequence of the first Fc portionof an IgG and that is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or100% identical to an amino acid sequence selected from the group SEQ IDNOs 34-46. 15-30. (canceled)
 31. The protein complex of claim 1, whereinthe interaction pair is an unguided interaction pair.
 32. The proteincomplex of claim 31, wherein the first member of the unguidedinteraction pair associates covalently or non-covalently with the secondmember of the interaction pair to form a dimeric complex.
 33. Theprotein complex of claim 31, wherein the first member of the unguidedinteraction pair has the same amino acid sequence as the second memberof the unguided interaction pair.
 34. The protein complex of claim 1,wherein the second polypeptide comprises the amino acid sequence of asecond member of the interaction pair, and wherein the secondpolypeptide does not comprise any other amino acid sequence that confersa substantial biological activity.
 35. The protein complex of claim 1,wherein the second polypeptide consists of the amino acid sequence of asecond member of the interaction pair, provided that the secondpolypeptide may comprise an additional 1-50, 1-40, 1-30, 1-20 or 1-10amino acids fused to the C-terminus, the N-terminus or both the C- andN-termini of the amino acid sequence of the second member of theinteraction pair.
 36. The protein complex of claim 35, wherein theadditional amino acids confer no substantial biological activity. 37.The protein complex of claim 1, wherein the protein complex binds to oneor more ligands selected from: GDF8, GDF11, activin A, activin B,activin C or activin E with a K_(D) of greater than or equal to 10⁻⁹,10⁻¹⁰, 10⁻¹¹, or 10⁻¹².
 38. The protein complex of claim 1, wherein theprotein complex inhibits in a cell-based assay signaling by one or moreligands selected from: GDF8, GDF11, activin A, activin B, activin C oractivin E.
 39. The protein complex of claim 1, wherein the proteincomplex exhibits a serum half-life in a mouse of at least 6, 12, 24, 36,48 or 72 hours.
 40. The protein complex of claim 1, wherein the proteincomplex exhibits a serum half-life in a human of at least 6, 8, 10, or12 days.
 41. The protein complex of claim 1, wherein the protein complexis a heterodimer.
 42. A pharmaceutical preparation comprising a proteincomplex of claim
 1. 43. The pharmaceutical preparation of claim 42,wherein the protein complex is at least 80%, 85%, 90%, 95%, 97%, 98% or99% pure with respect to other polypeptide components.
 44. Thepharmaceutical preparation of claim 42, wherein the pharmaceuticalpreparation comprises less than 20%, 15%, 10%, 5%, 3%, 2% or 1% ofhomodimers formed by the self-association of the first or secondpolypeptides.
 45. A fusion polypeptide comprising an amino acid sequenceof a follistatin-related polypeptide and the amino acid sequence of amember of an asymmetric interaction pair.
 46. The fusion polypeptide ofclaim 45, wherein the amino acid sequence of the follstatin-relatedpolypeptide comprises an amino acid sequence that is at least 80%, 85%,90%, 95%, 96%, 97%, 98%, 99% or 100% identical to the amino acidsequence of any of SEQ ID Nos: 1-33.
 47. The fusion polypeptide of claim45, wherein the fusion polypeptide comprises a linker polypeptidepositioned between the amino acid sequence of the follistatin-relatedpolypeptide and the amino acid sequence of the member of the asymmetricinteraction pair.
 48. The fusion polypeptide of claim 45, wherein themember of the asymmetric interaction pair comprises a constant domain ofan immunoglobulin. 49-50. (canceled)
 51. The fusion polypeptide of claim45, wherein the member of the asymmetric interaction pair comprises anamino acid sequence that is at least 80%, 85%, 90%, 95%, 97%, 98%, 99%or 100% identical to any of SEQ ID NOs: 34-46.
 52. (canceled)
 53. Theprotein complex of claim 1, wherein: a. the first polypeptide comprisesan amino acid sequence that is at least 80%, 85%, 90%, 95%, 97%, 98%,99% or 100% to the amino acid sequence of SEQ ID NO: 57; and b. thesecond polypeptide comprises an amino acid sequence that is at least80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% to the amino acid sequence ofSEQ ID NO:
 58. 54. The protein complex of claim 1, wherein: a. the firstpolypeptide comprises an amino acid sequence that is at least 80%, 85%,90%, 95%, 97%, 98%, 99% or 100% to the amino acid sequence of SEQ ID NO:71; and b. the second polypeptide comprises an amino acid sequence thatis at least 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% to the amino acidsequence of SEQ ID NO:
 72. 55. A nucleic acid comprising a codingsequence for the fusion protein of claim 45.